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Preface 



“What joy to discern the minute in infinity, the vast to perceive in 
the small, what divinity!” 

Jacob Bernoulli (1654-1705) in Ars Conjectandi (1713) 

We are proud to present to you the proceedings of the Sixth International Con- 
ference on Cellular Automata for Research and Industry (ACRI 2004), held in 
Amsterdam, The Netherlands on October 25-27, 2004. 

Since the first conference in Italy, ACRI, which is held biennially, has become 
the premier conference in the field of cellular automata in Europe and beyond, 
and is still growing in quality and size. 

This year’s theme was “From Individual to Collective Behavior” , emphasiz- 
ing the capability of Cellular Automata to simulate macroscopic processes from 
individual, local interactions. Cellular Automata, in spite of their apparent sim- 
plicity, represent a very powerful approach to studying spatio-temporal systems 
in which complex phenomena build up out of many simple local interactions. In 
the words of Richard Feynman in the Character of Physical Law (1982), “Nature 
uses only the longest threads to weave her patterns, so each small piece of her 
fabric reveals the organization of the entire tapestry” . 

John von Neumann, who is recognized as the father of cellular automata, 
would have been 100 years old in 2004. ACRI 2004 wanted to commemorate this 
date by inviting researchers to submit contributions related to von Neumann’s 
work or to the emergence of organization in systems in which collaboration 
between components wins over the individual behavior. 

In view of this commemoration we had two very inspiring memorial ple- 
nary lectures on the first day: “Von Neumann’s Century: Too many souls!” by 
Prof. Tomasso Toffoli and “John von Neumann and Cellular Automata” by Prof. 
Roland Vollmar 

Other invited lectures that were presented in the plenary sessions during the 
three meeting days were: “Pattern Discovery and Automated Theory Building” 
by Prof. James P. Crutchfield, “Studying Biological Development and Evolution 
with Multilevel Particle Systems ” by Prof. Paulien Hogeweg, “Cell Scale Simu- 
lations, the Neglected Link Between Microscopic and Continuum Modeling ” by 
Prof. James A. Glazier, “From Cellidar Automata to Wetware ” by Prof. Andrew 
Adamatzky, and “Structural Design and Optimization Using Cellular Automata” 
by Prof. Zafer Giirdal. 

We would like to express our sincere thanks to the invited speakers who 
delivered such inspiring lectures at ACRI 2004. 

The conference was organized along the following tracks: 

— Methods and Theory 

— Evolved CA 

— Traffic, Networks and Communication 
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Preface 



— Applications in Science and Engineering 

— Bio-medical Applications 

— Natural Phenomena and Ecology 

— Social and Economical Applications 

This volume contains peer reviewed original work on the theory and applica- 
tion of Cellular Automata. After peer review by three experts in the field, 40% of 
the 150 papers submitted were selected for oral presentation and 30% for poster 
presentation. A total of 30% of the submitted papers were rejected. 

This conference would not have been possible without the support of many 
people and organizations that helped in different ways to make it a success. 

First of all we would like to thank the authors for making the effort to sub- 
mit so many high-quality papers. We thank the Program Committee for their 
excellent job in reviewing the submissions and thus guaranteeing the quality of 
the conference and the proceedings. We thank Liesbeth Otte and the conference 
office of the University of Amsterdam for their practical assistance and support. 
Many thanks go to Coco van der Hoeven for her secretarial work. Dick van Al- 
bada, Berry Vermolen and Jiangjun Cui are acknowledged for their punctuality 
in preparing the draft of the proceedings. 

We thank our sponsors for their financial support: the board of the University 
of Amsterdam, the Science Faculty and the Institute for Informatics. Finally we 
thank the Dutch Science Foundation NWO, section Exact-Sciences, as well as 
the section Computational Life Sciences. 
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Abstract. This article describes the addition to the von Neumann cel- 
lular automaton of the Tom Thumb Algorithm, a mechanism developed 
for the self-replication of multi-processor systems. Except for the cell 
construction process, every functionality of the original CA has been 
preserved in our new system. Moreover, the Tom Thumb Algorithm now 
allows the replication of any structure within the von Neumann environ- 
ment, whatever its number of cells may be. 



1 Introduction 

In the middle of the last century, John von Neumann was interested in the 
concept of self-replication. During his research, he presented a model for an Uni- 
versal Constructor (UC) system based on a two-dimensional cellular automaton 
of 29 states [1], This UC is divided in two different parts: a tape that contains 
the description of the cellular machine to construct and the constructor in itself 
that reads the tape and builds the new corresponding machine with a dedicated 
cellular arm. Moreover, if the tape contains the description of the constructor 
and if the constructor, in addition to its building abilities, can also copy the tape 
information, then the resulting system is a self-replicating one (figure 1). 

In spite of the conceptual success of von Neumann’s work, the realization of 
such a system encounters a major problem: with the proposed architecture, the 
UC in itself needs about 200’000 cells, while the tape containing its description 
is almost five time greater. Clearly, it is too big for a physical implementation 
(at least today) and it is a real challenge even for a software simulation. In fact, 
the UC has not yet been completely specified [2]. 

In 1995, Umberto Pesavento presented an extension of the transition rules of 
the standard system [3]. Thus, he drastically reduced the number of cells used in 
the UC replication and proved the validity of the von Neumann self-replication 
process by simulating the entire UC. Despite that, once again, there were too 
many cells for a physical implementation. To solve this problem, we decided to 
apply to the standard von Neumann system a new kind of self-replicating process 
that uses a smaller number of cells: the Tom Thumb Algorithm (TTA). 

In section 2, we will briefly present the standard von Neumann system. The 
basic operation of the Tom Thumb Algorithm will be exposed in section 3. 
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Fig. 1 . Universal Constructor self-replication 



Section 4 will then introduce the system we have developed for self-replication 
in the von Neumann environment and in section 5, we will discuss of some 
potentialities of such a realization from an evolutionary point of view. Finally, 
we will briefly mention the hardware implementation of our system in section 6. 



2 Von Neumann States and Transitions 

As mentioned before, each cell of the von Neumann CA possesses 29 states. 
These states can be divided into several different groups with regard to their 
” functionality” . 

The first state is the quiescent one. A cell in this state has no influence on its 
neighborhood. Every unused cell of the cellular array is in the quiescent state. 

Then we find the 16 states responsible for the propagation of excitations 
through the cellular array. Each of these states has a direction, i.e. one of the 
four cardinal ones, and can be excited or not. Moreover there are two kinds of 
transmission states that propagate different types of excitations: the ordinary 
and the special ones. An ordinary (respectively special) transmission state be- 
comes excited if it receives an ordinary (respectively special) or a confluent (see 
below) excitation on one of its inputs. Moreover, they introduce a delay of one 
time unit in the propagation of the excitations and act as OR gates. 

A third group contains the confluent states. These are also used for the 
transmission of excitations. A confluent cell becomes excited only if all of the 
neighboring ordinary transmission states directed to it are excited. It conse- 
quently acts as an AND gate. The confluent states are also used to convert an 
ordinary excitation into a special one and they permit the splitting of a trans- 
mission line, acting as fan-outs. Moreover, they introduce a two time units delay 
in the propagation of information. Finally, they act as memory units: when none 
of its neighbours is an outgoing transmission state, if the confluent has been 
previously excited, it keeps the excitation until it can release this latter through 
a valid transmission state. 

Pesavento’s extension of the standard rules provides another type of confluent 
state that permits the crossing of excitations. 

The last group contains the sensitive states that are transitory ones and 
permit the construction of each specific cell type: when a quiescent state receives 
an excitation, it goes in the first sensitive state. Then, its state changes at each 
time unit as seen in figure 2. 
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Fig. 2. Sensitive states and construction process 



To the construction process corresponds a destructive one: the transition 
from any state to the quiescent one. When a cell is in an ordinary transmission 
or in a confluent state and receives a special excitation, it is destructed and 
becomes quiescent again. The same thing happens when a special transmission 
state receives an ordinary excitation. 

3 Tom Thumb Algorithm 

To find an alternative self-replication mechanism for the UC, we define the entire 
constructor as a two-dimensional organism composed of multiple cells. Each cell 
is defined by two genes: the first one corresponds to the cell’s state according to 
von Neumann while the second, as we will see, will be used to direct the con- 
struction and replication processes. Using these definitions, we can then apply to 
the machine a self-replication mechanism known as the Tom Thumb Algorithm 
(TTA) [4,5]. 

Without loss of generality, we will present the TTA construction process of 
a four-cell organism defined by its genome showed at the top of figure 3. The 
numbers correspond to von Neumann states, while the other genes define the 
directions of the construction path (arrows) as well as the branching directions 
for the self-replication process (background color of arrows in the figure). 

Moreover, although defined by only two genes, each cell of the organism 
has four memory positions: the two at the right in figure 3 are used to store 
the cell configuration, i.e. its state and the construction information, while the 
two remaining ones keep a copy of the genome indefinitely moving around the 
organism. 

The construction process proceeds as follows: when the genome is injected in 
the cellular array, the first cell receiving it shifts the incoming genes until it has 
stored its configuration in the corresponding registers (t=4 in figure 3). At that 
time, the cell knows where to forward the next incoming genome information 
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Fig. 3. Genome and first steps of the TTA creation of a four-cell organism 



(to the North for the cell of the figure). The next genes arrive in the upper cell 
which will also shift them to get its configuration and find the way to forward the 
remaining genes (East, t=8). This process then repeats itself until the organism 
is fully constructed, i.e. at time t= 16 in the figure. 

Note that twice the whole genome has been injected for the organism con- 
struction: one half of the genes is fixed in the configuration registers while the 
other half is continuously shifted around the organism, following the construction 
path. 

The self-replication process is started when the first gene of the genome ar- 
rives in a cell whose configuration allows branching. In our example, at time 
t=12, a replication path is consequently initiated to the North. The running 
genome is duplicated and sent through the replication path. This process obvi- 
ously starts the construction of a second organism whose genome is the same as 
for the first one: the organism replicates itself. 

Note that when the genome has been transmitted twice through the replica- 
tion path, the construction of its copy is finished and the branching replication 
path can be removed. Note also that the construction path of a valid organism 
has to be a loop; if it were not, the copy of the genome would not have been 
able to run around the organism and the replication process would have failed. 



4 TTA and UC 

Our main goal was to add to the von Neumann system an innovative process, 
in our case a modified version of the TTA, that could drastically reduce the 
number of cells used for the self-replication. Then, the resulting system had to 
keep every standard functionality of the original von Neumann CA, as well as 
implementing the Pesavento crossing ability. Consequently, we have separated 
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our system design in two different layers and we added a new cell type to the 
system: the activator , used to launch the self-replication process. 

The first layer (UClayer) contains the von Neumann standard information 
coded on four bits. The two first bits determine the cell type (quiescent 00, or- 
dinary arrow 01, special arrow 10, and confluent 11). The two remaining ones 
define the arrow direction or the confluent subtype (memory, crossing or activa- 
tor). We show in figure 4 the different UC layer codes and their corresponding 
cell type. 



Type 


Dir 


Cell symbol 




00 
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01 


00 
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ordinary North arrow 


01 


01 


-»• 


ordinary East arrow 
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ordinary West arrow 
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01 
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special East arrow 


10 
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special South arrow 
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11 
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00 
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not used 


11 


01 


R 


activator confluent 


11 


10 


c+ 


crossing confluent 


11 


11 


Gxx 


memory confluent 



Fig. 4. UC layer codes 



The second layer (TTAlayer) is only used for the construction and replication 
processes. It’s also defined with four bits: the first two give the direction of the 
replication path, the third bit is a flag that says if the cell is the first one of the 
organism and, finally, the last bit corresponds to the end-of-organism cell flag. 

Moreover, in our TTA version, the path does not have to be a loop, as in the 
standard algorithm: in our case, the path begins at the start-of-organism cell 
and then goes through the entire organism, from cell to cell, until it reaches the 
end-of-organism cell. 



4.1 Cell Construction Process 

As we have just seen, a TTA-UC cell is fully defined by the eight configuration 
bits of the two layers. This fact did not obviously permit us to keep the same 
construction process as in the original von Neumann system, but we maintained 
the same general idea: a quiescent cell is transformed in another cell when it 
receives the corresponding (ordinary or special) excitation. To implement the 
construction process, as shown in figure 5, we added three flags (with the grey 
background) to the eight configuration bits. 
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Fig. 5. The different construction pulses 



Reading the bits from left to right, the first flag bit is used to start the cell 
construction process and is always equal to ’1’. Then we find the UClayer type 
on two bits (aa in figure 5), followed by the UClayer direction, on two bits too 
(bb). 

After these, we have the second flag bit: if it is equal to ’1’, the cell construc- 
tion process stops. In such a case, the cell has updated its UClayer state, but 
without modifying its TTAlayer configuration. This could be used to create a 
new self-replicating organism on a pre-existing TTAlayer, or just to change some 
cells in a TTA-UC structure without affecting its self-replication ability. 

In the other case, i.e. if the second flag is equal to ’O’, the construction process 
continues: the two next bits update the TTAlayer direction (cc) and the last two 
define the TTAlayer type (dd), i.e. start or end of organism. 

At the end, we find the last flag bit. While it is equal to T’, it makes the 
cell wait for the entire organism to be constructed, before being able to act 
like an usual von Neumann cell. During that time, the cell only forwards every 
excitation it receives in the direction defined by its TTAlayer. 

A cell can then be constructed using a pulse of six excitations, as long as 
we want to keep the TTAlayer unchanged. Otherwise, the construction pulse 
needs eleven excitations, with the last being inactive for a single cell creation 
and active in the case of an organism self-replication. 



4.2 Organism Self-Replication 

We will now present the self-replication process of a simple organism. For clarity, 
we will take as example the replication of a two by two cells organism. The 
two layers of the organism and the configuration bits of its cells (without the 
construction flags) are shown in figure 6. The start-of-organism cell is in dark 
grey while the end-of-organism one is in light grey. 
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Fig. 6. Two layers and configuration bits of the organism 
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We will first present the emission of the organism genome and then, show 
how this genome is used in the construction of the new replicated organism. 
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Fig. 7. Some steps of the TTA-UC genome emission 



Genome emission: An organism will start its replication process if one of 

the neighbours of the start-of-organism cell is an excited activator cell (R in 
figure 7 at time t=0). In such a case, this latter sends a StartReplication message 
(SR in the figure); the start-of-organism cell then emits to its TTA direction a 
InternalReplication message (IR) at time t=l. This latter will then be forwarded 
along the TTA path through the entire organism until it reaches its last cell, 
making every cell of the organism become in a TTA forwarding state: from that 
time, they transmit to their TTA direction every excitation they receive. 

Just after the InternalReplication message, the start-of-organism cell sends its 
configuration bits with the appropriate flags during the following 11 clock steps. 
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As for the first message, these configuration bits are then forwarded through the 
TTA path until they reach the end-of-organism cell. This latter transmit the 
configuration pulse to the UC layer of the next cell in its TTA direction. 

When the first cell has finished sending its configuration stream, it emits once 
again a InternalReplication message (t=13) meaning it has finished its replica- 
tion task. When the next cell on the TTA path receives this InternalReplication 
message, it goes out of the TTA forwarding state and begin the emission of its 
own configuration bits. 

This process repeats itself in a similar way until the end-of-organism cell 
has transmitted its last configuration bit: at that time (t=49), the organism has 
finished its genome emission. 
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Fig. 8. Some steps of the TTA-UC replicate construction 



Construction: We said that the end-of-organism cell transmits the configu- 

ration pulses to the UC layer. The excitations then consequently follow the UC 
layer specifications: they can travel along an ordinary arrows path, if any, until 
they reach a quiescent cell to configure. In figure 8, that happens at time t=5. 
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When a quiescent cell has received eleven building excitations, it is fully 
configured. But, as we have seen, the last configuration bit has to be chosen 
equal to ’1’ in the self-replication process. This flag implies that the cell must 
stay in a TTA forwarding state until the end of the organism construction. 
Indeed, it has to transmit the next received excitations to its TTA direction in 
order to pursue the creation of the next organism cells, in the good direction. 

When the last configuration bit has reached the new end-of-organism cell 
(t=51), this latter sends an InternalReplication message back along the TTA 
path. This message, that notifies the end of the organism construction is for- 
warded back by each cell of the organism until it reaches the new start-of- 
organism cell, at t=54 in the figure. 

The next time unit, the start-of-organism cell becomes excited in its UC 
layer and can launch the classical UC behaviour of the new organism: the self- 
replication process is terminated. 



5 TTA-UC and Evolution 

As explained before, and unlike the standard Tom Thumb Algorithm, our TTA- 
UC system does not possess a running genome that keeps unchanged the initial 
configuration of the organism. In fact, our replication system works by self- 
inspection [6]: the organism generates the bits configuration that permits to 
copy its current state (and not the initial one). 

Having such an ability should permit a TTA-UC organism to evolve through 
its replication process: for example if a cell of the organism has got its UCconfig- 
uration modified during its normal behavior, without changing its TTAlayer 
information, then the organism can still replicate itself but will also copy the 
modified cell. As a result, this new organism will not be a perfect copy of the 
initial one: almost every UClayer modification can propagate through the or- 
ganism generations. An organism population could consequently evolve by the 
integration of the acquired modifications into the organism configuration of the 
’new generations’. 



6 Hardware Implementation 

The hardware implementation of the TTA-UC uses the BioWall, a two- dimen- 
sional electronic wall designed for bio-inspired applications [7]. In order to test 
the concept, we have realized a simple von Neumann pulser composed of about 
thirty cells. We were able to verify that the pulser can self-replicate in addition 
to realizing its usual functionality. 

Moreover, due to the huge number of states the TTA-UC design would have 
used for a standard CA implementation, we realized the system following the 
Data and Signal Cellular Automaton (DSCA) concept [8,9]. Such a choice per- 
mited us to design the TTA-UC almost intuitively and to make the design fit 
the limited place at disposal. 
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7 Conclusion 

The designed system comes from the unification of the Tom Thumb Algorithm 
with the standard von Neumann architecture. Its major quality is the great 
diminution in the number of cells that an organism needs for its self-replication. 
In fact, with this new TTA-UC system, any von Neumann organism can now be 
configured in order to possess the self-replication ability. As a result, the concept 
of Universal Constructor in itself becomes obsolete. 

Moreover, we believe that our TTA-UC system is able to emulate all of the 
precedent von Neumann functionalities. A further interesting work could be to 
confirm that a Turing Machine can be successfully emulated by our new TTA- 
UC architecture. Additionally, one should be able to realize this implementation 
with only small modifications to the original design of the von Neumann Turing 
Machine, due to the differences in the cell construction process. 
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Abstract. Standard Cellular Automata (CA) are memoryless: i.e., the 
new state of a cell depends on the neighborhood configuration only at the 
preceding time step. This article considers an extension to the standard 
framework of CA by implementing memory capabilities in cells. Thus 
in CA with memory : while the update rules remain unaltered, historic 
memory of past iterations is retained by featuring each cell by a summary 
of all its past states. A study is made of the effect of historic memory on 
two given sides of the hypercube of elementary probabilistic CA. 



1 Introduction 



Cellular Automata (CA) are discrete, spatially extended dynamic systems com- 
posed of adjacent cells or sites arranged as a regular lattice, which evolve in 
discrete time steps. Each cell is characterized by an internal state whose value 
belongs to a finite set. The updating of these states is made simultaneously ac- 
cording to a common local transition rule involving only a neighborhood of each 
cell. Thus, if is taken to denote the value of cell i at time step T, the site 
values evolve by iteration of the mapping : cr.[ T+l 1 = (/>(ov £ A/)) , where (j> 

specifies the CA rule operating on the neighborhood (A f) of the cell i. 

This paper deals with elementary rules : one-dimensional CA with two pos- 
sible values ( k = 2) at each site (a £ {0,1}), with rules operating on near- 
est neighbors. Following Wolfram’s notation, these rules are characterized by 
a sequence of binary values {fa associated with each of the eight possible 



triplets (cr-^,cr } - T) 



TD) . 

i+l) ' 



111 110 101 100 Oil 010 001 000 

a a a a a a a a ■ Ihe rule number, 
Pi P2 P3 P4 P 5 P6 P 7 P8 



R = '^^fa2 8 1 , of elementary CA ranges from 0 to 255. We will restrict our con- 

»= l 

siderations to legal rules : rules that are reflection symmetric {fa = fa, fa — fa), 
and quiescent {l = 0) . These restrictions leave 32 possible legal rules. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 11—20, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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2 Memory 



Standard CA are ahistoric (memoryless): i.e., no memory of previous iterations 
except the last one is taken into account to decide the next one. In this paper, 
a variation of the conventional one-dimensional CA is considered, featuring cells 
by a summary state in all the previous time steps . Thus, what is here proposed is 
to maintain the transition rules (</>) unaltered, but make them actuate over cells 
featured by a summary of all their previous states : cr- T+1 ^ = <t>(s^ £ A/)), s- T ' ) 
being a summary state of cell i after time-step T . We refer to these automata 
considering historic memory as historic and to the standard ones as ahistoric. 

A simple way of taking history into account is that of featuring cells by their 
most frequent (mode) state (in case of a tie, the cell is featured by its last state). 
Table 1 shows the spatio-temporal patterns of the legal rules affected by such 
mode memory mechanism when starting from a single site live cell. As long as the 
latest and most frequent states coincide after the two first time-steps, the historic 
and ahistoric evolution patterns are the same till T — 3. But after the third 
time-step, the last and the most frequent states often differ, and consequently 
the patterns for the historic and ahistoric automata typically diverge at T = 4. 
In the ahistoric context, the deterministic Rule 254 (1111110) progresses as fast 
as possible: it assigns a live state to any cell in whose neighborhood there would 
be at least one live cell. Thus, in the ahistoric model, a single site live cell will 
grow monotonically, generating segments with size increasing two units with 
every time step. The dynamics is slower in the historic model (see Table 1) : 
the outer live cells are not featured as live cells until the number of times they 
’’live” is equal to the number of times they were ’’dead”. Then the automaton 
fires two new outer live cells. Rules 94 and 126 evolve much in the punctuated 
equilibrium manner of Rule 254. The effect of memory on Rule 90 (01011010) is 
dramatic: after T = 3 the most frequent state of all the cells is the dead one, so 
Rule 90 dies out at T = 4 . The remaining rules in Table 1 generate period-two 
oscillators as soon as at T=2. 

Historic memory can be weighted by applying a geometric discounting pro- 
cess in which the state a- T ~ T \ obtained r time steps before the last round, is 
actualized to the value: a T a\ (a being the memory factor). This well known 
mechanism fully ponders the last round, and tends to forget the older rounds. 
After the time step T, the weighted mean of the states of a given cell ( i) is : 






r'P + I 



UJ, 



(T) 



T—l 






a 



T-t 



A(T) 



(1) 



The choice of a simulates the long-term or remnant memory effect : the limit 
case a = 1 corresponds to the full memory model just introduced, whereas 
a « 1 intensifies the contribution of the most recent states and diminishes the 
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contribution of the past ones ( short type memory). The choice a = 0 leads to 
the ahistoric model. This memory mechanism is accumulative in their demand 
of knowledge of history : to calculate the term of m| T ' ) it is not necessary to 
know the whole {<r®} series, while sequentially compute: = auj[ T ^ +af\ 

The rounded weighted mean state (s) will be obtained by comparing its 

1 

weighted mean (m) to 0.5, so that: s- T ^ = ) 



if m- T ^ > 0.5 
if = 0.5 



In our 



a i 

0 if m [ T < 0.5 . 
two-states scenario, memory is not effectively implemented unless a > 0.5 ([4]). 

It should be emphasized that the memory mechanism considered here is 
different from that of other CA with memory reported in the literature 1 , which, 
typically, incorporate memory into the transition rule. Particularly interesting 
is the second order in time reversible formulation based on the exclusive OR 
(noted ©) : cr,- T+1 ^ = € A/)^ © er- T_1 B Double memory (in rule and in 

cells) can be implemented as : c ! - T+1 ^ = </>^ s ^ € A © s- T but to preserve 
reversibility, inherent in the XOR operation, the reversible formulation with 
memory must be : cr,| T+1 ^ = 4 , (y s 'f" > e A © cr- T_1 ^ ([4]) . We have studied the 
effect of memory on CA in the deterministic scenario in a previous work ([2] -[8]). 



Table 1. Legal rules affected by mode memory when starting from a single site live 
cell up to T = 8. Live cells: ■ last (cr=l), □ most frequent ( s = 1). 



146,218 178,250 

R 18,90 R 22,54 R 50,122 






□ □ □ □ 



R 94 



R 126 R 150,182 



222 
R 254 



.Him 

'mm 




3b: 

3b: 




3 Probabilistic Cellular Automata with Memory 

So far the CA considered are deterministic. In order to study perturbations 
to deterministic CA as well as transitional changes from one deterministic CA 
to another, it is natural to generalize the deterministic CA framework to the 
probabilistic scenario where the /3j are replaced by probabilities Pi (0 < Pi < 

1 See, for example, [l],p.43; or class MEMO in [9],p.7. The latter shows an example 
of 2D CA with memory, but to the best of our knowledge the study of the effect of 
memory on deterministic CA has been rather neglected. 
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1). Kinzel [10] and Domany and Kinzel [11] pioneered in the study of these 
probabilistic CA (PCA for short). 

As in the deterministic scenario, memory can be embedded in PCA by featur- 
ing cells by a summary of past states Sj instead of by their last state ct,; . A last but 
not least disclaimer must be done of the use of the term memory as entering into 
the probabilistic scenario. We embed historic memory into the characterization of 
cells but not directly in the construction of the stochastic transition rules, so we 
are dealing with probabilities of the generic form P(ct- T+ 1 ^ = l/s^i, s[ T \ s-^) 
and not with some form of memory mechanism (of length r) in which the past 
configurations ( a = {cq,i = 1,...,N}) determine the transition probabilities 

: P(cr[ T+1 ^ = l/er (T \ ed T_1 \ . . . , g^ T ~ T ^) as done in the specialized literature 
when dealing with memory in PCA (e.g. , [12] ) . 

Legal rules are now of the form (pi, P2, P3, Pa, P2, Pe, Pa, 0) ■ The five probabil- 
ities parameterize a five-dimensional lrypercube , where the deterministic rules 
are placed in the 32 corners. Elementary CA quickly evolve into configurations 
with a density of non null states fluctuating about a given mean (p) so the asymp- 
totic density is considered in the literature as a natural order parameter in the 
phase space . In this paper we focus on two particular sides of the lrypercube, 
those defined by the structure of probabilities : i) (0, 0,p 3 , 1, 0,p 6j 1) 0) , with de- 
terministic corners the Rules 18, 22, 50 and 54 , and ii) {p\,p2,Pi,P2,P2, 0 ,P 2 , 0), 
with corners 0, 90, 160 and 250. The former is a very interesting subset as in- 
cluding not only Rule 18 (one of the first rules carefully analyzed) but also the 
complicated (to avoid the coined term complex) Rules 22 and 54. The rules with 
the structure ii) are peripheral 2 and totalistic < 7 J - T+1 ' ) = <p[a[ T { + 0 ^+ 1 ) such as 
Rule 90 : erj T+1 ^ = cr-^ + cr-TJ mod 2 . The subset ii) has been very much stud- 
ied due to the equivalence between peripheral PCA and the problem of directed 
percolation [11]. By the way, Rule 22 in i) is totalistic but not peripheral. 



4 The Simulations 



The simulations in this work were run for 1000 time steps on a lattice consisting 
of 500 sites with periodic boundary conditions. All the simulations start from 
the same random initial configuration, in which the value of each site is initially 
uncorrelated, and is taken to be 0 or 1 with probability 0.5 . All the PCA evolve 
with identical realizations of the stochastic noise. The memory factor a varies 
from 0.5 (ahistoric model) to 1.0 (fully historic) by 0.1 intervals. 

Figure 1 shows the spatio-temporal patterns of four significant deterministic 
rules affected by memory. Figures 2 and 3 show the asymptotic density ( p ) of the 
PCA rules of the two subsets just described, when varying the free probabilities 
(P 3 ,P 6 , and pi ,£> 2 ) from 0 to 1 by 0.02 intervals. The asymptotic density was taken 
as the mean density over the last 100 iterations. Table 2 shows the numerical 

2 Rules with no dependence in <j> on the state of the cell to be updated. Some authors 
feature such rules as rules with no memory 
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values of p of the deterministic rules affected by memory. Memory has no effect 
on in Rules 0, 160, 250 (subset ii)) or has a minimal effect in Rule 50 3 . 



Table 2. Asymptotic densities of the deterministic rules affected by memory 



a = 1.0 0.9 0.8 0.7 0.6 0.5 

RULE 18(00010010) 0.120 0.000 0.000 0.012 0.108 0.250 
RULE 22(00010110) 0.359 0.038 0.026 0.295 0.357 0.356 
RULE 50(00110010) 0.478 0.500 0.500 0.500 0.500 0.500 
RULE 54(00110110) 0.461 0.000 0.020 0.412 0.411 0.478 
RULE 90(01011010) 0.503 0.521 0.497 0.495 0.507 0.500 



Let us first examine the effect of memory on subset i), featured to a great 
extent by the effect on the deterministic corners. History has a dramatic effect 
on Rule 18, as is shown in Fig.l . Even at the low value of a = 0.6, the appear- 
ance of the spatio-temporal pattern fully changes: a number of isolated periodic 
structures are generated, far away from the distinctive inverted triangle world of 
the alristoric pattern. For a = 0.7, the live structures are fewer, advancing the 
extinction found in [0.8, 0.9]. In the fully historic model, only periodic patterns 
of live cells survive (simpler than for a = 0.6). The evolution of the density ac- 
cording to Rule 18 in the alristoric model yields an asymptotic density p ~ 1/4 
(a value reached fairly soon). Historic memory induces a depletion of p , null for 
a = 0.8 and a = 0.9, as quantified in Table 1. The effect of memory on Rules 22 
and 54 is similar. Their spatio-temporal patterns in a = 0.6 and a = 0.7 keep 
the essential feature of the alristoric, although the inverted triangles become en- 
larged and tend to be more sophisticated in their basis. A notable discontinuity 
turns out for both rules ascending in the value of the memory factor: when a is 
in the [0.8, 0.9] interval, extinction or a few periodic structures surviving is found 
for both rules. But unexpectedly, the patterns of the fully historic scenario differ 
markedly from the others, showing a high degree of synchronization. Both rules 
present notably close values of the asymptotic densities in the alristoric and full 
memory scenarios (see Table 2). 

The effect of memory on the deterministic corners of the subset i) in reflected 
in the asymptotic density graphs of Fig. 2. Thus, the inhibition induced by mem- 
ory on Rules 18, 22 and 54 when a is in the [0.8, 0.9] interval is reflected in a clear 
depletion of the corresponding graphs in Fig. 2, in which only the proximities of 
the corner R50 are not depleted. When a varies in the [0.6, 0.7] interval, the p 
value of Rule 54 is not notably altered, thus depletion in the 0.6 and 0.7 graphs 

3 This is unusual but not extremely exceptional in the deterministic scenario. This is 
case of Rules such as 0, 160 or 32 which early evolve to the null state, or, of Rules 
250 and 254 which blacken the space, very soon in the ahistoric model, a little later 
with memory. Rules which serve as filters (Class II), such as Rules 4, 50 72, 76, 200, 
204, 232 and 236 are also minimally affected by memory. When dealing with more 
than two states, memory tends to affect almost every rule ([3]) 
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is restricted only to the R18 corner. The aspect of the full memory graph is fairly 
particular, with a dominant tendency to the null value in proper probabilistic 
rules, but not in the fairly deterministic ones (the sides). 

The evolution of Rule 90 (subset ii)) is typically featured as chaotic in the 
ahistoric scenario. The totalistic Rule 90 shows a much smoother evolution from 
the ahistoric to the historic scenario (Fig.l): no pattern evolves to full extinction 
nor to the preservation of only a few isolated persistent propagating structures 
(solitons). In the ahistoric scenario it is p ~ 1/2 for Rule 90, independently 
of the initial density po ^ 0 . When historic memory is taken into account, p 
varies little from this value (see Table 2). Again, the effect of memory on the 
deterministic corners is reflected in Fig. 3. Thus, the gradual effect of memory 
on Rule 90 is reflected in a smooth effect of memory on the graphs of Fig. 3 

The results found here starting with an initial density po =0.5 regarding the 
asymptotic density, have been found similar when starting from a high initial 
density (0.9) and a low one (0.1). We have studied the effect of memory on 
the damage spreading: the difference between two replicas, which evolve with 
the same dynamics but from different initial configurations, in both subsets. As 
a rule, the effect of memory on the difference patterns mimics that on spatio- 
temporal patterns. 

The smooth effect of memory on the peripheral and totalistic subset ii) has 
been found also for other totalistic but non peripheral subsets, in particular on 
the important PCA subsets studied by Kinzel [10]: {P 2 ,P 2 ,P 2 ,P 4 ,P 2 ,P 4 ,P 2 ,$), 
{Pi,P2,P2,P2,P2,P2,P2,0), and (P4,P2,P2,P4,P2,P4,P2,0)- We have explored the 
effect of memory in a number of different no totalistic subsets. We expect to 
report the results of the effect of memory on these subsets in a subsequent work. 



5 Other Memories, Other Contexts 

A number of alternative average-like memory mechanisms can readily be pro- 

t ' T 

posed by writing the components of (1) as : Lof r " > = E^ , A{T) = 

t= 1 t= 1 

. Thus, the weighting factor can be implemented in an exponential way : 
St = P € IR + (equivalent to a geometric discount model with a = e _/3 ), 

or in an inverse way, so that the older rounds are remembered more than the 
more recent ones, by choosing S t = a 4 ” 1 . Weights such as S t = t c or c 4 , c € IN, al- 
low integer based implementation (a la CA) by comparing 2 o> (T ) to A(T) ([3], [6]). 

Departing from the weighted mean memory scheme, cells can be featured by 

(T) 

the parity with the sum of previous states, or by some neural- like form: s\ ' = 
T 

«(E 8t(rp — 9) . A limited trailing memory can be implemented by keeping 

i= T 

memory of the last r states. Ahistoric and historic models can be combined by 
different types of characterizations of the cells of the neighborhood. 
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Fig. 1. Spatio-temporal patterns of four legal CA rules affected by memory. The simula- 
tion was conducted in a lattice of size 500. The values of sites in the initial configuration 
are chosen at random to be 0 or 1 with probability 1/2 . The pictures show the evolu- 
tion of the central band of 250 sites from the initial configuration up to T = 150.. The 
memory factor a is varied from 1.0 (full memory) to 0.5 (ahistoric) by 0.1 intervals. 
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Fig. 2. The asymptotic density of the legal probabilistic CA rules of the form 
(0, 0,P3, 1, 0,P6, 1, 0) in the scenario of Fig.l. The probabilities P 3 and pe are varied 
from 0 to 1 by 0.02 intervals . The asymptotic density is taken as the mean density 
over the last 100 of 1000 time steps. 



Historic memory can be embedded in continuous CA, in which a ranges in 
IR , by : cy T+l ■* = <&(rn\ ^ 6 A/i). Memory can be implemented in this way in 
Coupled Map Lattices, fuzzy CA (see an example in [7]), or Quantum CA. 
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Fig. 3. The asymptotic density of the legal probabilistic CA rules of the form 
(Pi,P 2 ,Pi,P 2 ,P 2 , 0,p2, 0) in the scenario of Fig. 2. 



We have explored the effect of incorporating memory into discrete dy- 
namical systems : ad T+1 ) = /(ad T )) by means of x^ T+1 ' > = /( m with 

T 

m{T ' > = ATT) E a T t x^ t \ Thus, [3] deals with the canonical example, the lo- 
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gistic map with memory : x < - T+1 ' 1 = m^ 1 " 1 + 3.0m^ T ^(l — m*- 7 ^) . A low level of 
memory (a = 0.1) smooths the characteristic chaotic dynamics of the alristoric 
formulation; higher a values generate periodic behaviour, and from a = 0.4 the 
evolution leads to a fixed point {x = 1) . From chaos to order, by varying not 
the parameter of the model (A = 3.0) but the degree of memory incorporated. 

6 Conclusion 

This paper introduces the kind of probabilistic CA with memory in cells (PCA) 
in the simplest (elementary) scenario by considering two particular sides of the 
lrypercube of elementary PCA. One of the subsets is highly disrupted by memory, 
the effect of memory on the other subset is fairly smooth. The approach here is 
mainly descriptive. A more complete analysis of the PCA is left for future work. 
Phase transitions in particular demand a detailed study. Some critics can argue 
that memory is not in the realm of CA (or even of Dynamic Systems), but we 
believe that the subject is worth to studying. At least CA with memory in cells 
can be considered as a extension of the basic paradigm, useful in modeling. 
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Abstract. Computation- and construction-universality in cellular au- 
tomata (CA), first studied by von Neumann, has attracted steady re- 
search efforts, over the years, most employing synchronous CA. Asyn- 
chronous cellular automata (AC A), though of interest as most interac- 
tions in nature are asynchronous, have not been used for this task, other 
than by the indirect way of simulating a synchronous CA. In this paper, 
we propose a universal constructor on a self-timed cellular automaton 
(STCA), a particular type of ACA, in which cells are divided in four 
partitions, each with four states. 



1 Introduction 

Biological self-reproduction was first investigated formally in terms of von Neu- 
mann’s cellular automaton (CA) capable of universal computation and construc- 
tion [1], Requiring cells with 29 states, this CA implements a universal construc- 
tor able to construct an arbitrary structure on an initially blank space, given 
its blueprint. Follow-ups to this study are the 8-state CA of Codd [2], the 4- 
state CA of Banks [3], and the 3-state CA of Serizawa [4], all computation- and 
construction-universal. These investigations have been limited to synchronous 
systems, while there has been hardly research on asynchronous systems. 

In the context of asynchronous systems, Priese [5] investigated computation 
and construction on a Tlrue system that resembles CA. Nelraniv [6] implemented 
self-reproduction on an asynchronous cellular automaton (ACA) based on the 
method by Nakamura [7], in which a synchronous CA is simulated on an ACA, 
but this technique suffers from overhead in terms of cell states and number of 
transition rules. Lee et al. [8], Adaclri et al. [9], and Peper et al. [10] studied 
computation on ACAs, not by this simulation technique, but by directly embed- 
ding delay-insensitive (DI) circuits on them, a type of asynchronous circuit in 
which signals may be subject to arbitrary delays without this being an obstacle 
to the circuit’s correct operation (e.g. see [10,11]). Peper et al. [12] proposed a 
computation-universal self-timed cellular automaton (STCA), a particular type 
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of ACA, in which the state of each cell is partitioned into four parts, each of which 
corresponds with a neighboring cell. There have been no construction-universal 
ACAs (or STCAs) proposed, however, by the direct method. 

This paper shows that computation- and construction-universality can be 
implemented on an STCA by the direct method, in a similar way as von Neu- 
mann’s construction. Our model employs cells with states encoded by 8 bits, 
and 39 rotation-invariant transition rules. 

This paper is organized as follows. We define self-timed cellular au- 
tomata in Sect. 2, starting with computation-universality, and then move on 
to construction-universality. The construction arm and the tape unit, each part 
of the constructor, are described in Sects. 3 and 4, respectively. Sect. 5 gives the 
basic idea of the universal constructor implementation, and finishes this paper 
with a discussion and conclusion. 



2 Self-Timed Cellular Automata 

A self-timed cellular automaton (STCA) [12] is a two-dimensional asynchronous 
cellular array of identical cells, each of which has a state that is partitioned into 
four parts in one-to-one correspondence with its neighboring cells. For example, 
if each part of a cell state consists of 1 bit, a cell can be in one of 16 states 
encoded by 4 bits, and the cellular space is an array like in Fig. 1. Each cell 
undergoes transitions in accordance with a transition function / that operates 
on the four parts of the cell q n , q e , q s , q w and the nearest part of each of its four 
neighbors p n , p e , p s , p w . The transition function f is defined by 

/ : ( q ni qe,qs,qw,Pn,Pe,Pa,Pw ) (tfntdeitfs’tfwiPniPeiP'siPw)’ W 

where a state symbol to which a prime is attached denotes the new state of a 
partition after update (see Fig. 2). We assume that transition rules on an STCA 
are rotation symmetric, thus each of the rules has four rotated analogues. 

In STCA, transitions of the cells occur at random times, independent of 
each other. Since a transition on a cell may change bits of neighboring cells, 
two neighboring cells undergoing transitions simultaneously may write different 
values in shared bits. To prevent such a situation, we assume that neighboring 
cells never undergo transitions simultaneously. 



2.1 Morita’s Rotary Element 

Morita’s Rotary Element (RE) [13] is a logic element with four input lines 
{n, e, s, w}, four output lines {n 1 , e' , s', w’}, and two states — the H-state and the 
V-state, which are displayed as horizontal and vertical rotation bars respectively 
(see Fig. 3). If a signal comes from a direction parallel to the rotation bar of an 
RE, it passes straight through to the opposite output line, without changing the 
direction of the bar (Fig. 4(a)); if a signal comes from a direction orthogonal to 
the bar, it turns right and rotates the bar by 90 degrees (Fig. 4(b)). It is possible 
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Fig. 1. An example of self-timed cellular 
space. A filled circle denotes a 1-bit, and 
an open circle denotes a 0-bit. 
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Fig. 3. An RE in (a) H-state and an RE 
in (b) V-state. 



Fig. 2. Transition rule in accordance with 
the function /. 




Fig. 4. The operations of an RE: (a) the 
parallel and (b) the orthogonal case. 



to construct a universal Turing machine as a circuit composed of only REs, as 
shown by Morita [13], implying that an RE is a universal logic element. 

An RE can be considered as a kind of a DI circuit. Unlike most Di-circuits 
in practical applications, however, a circuit constructed entirely from REs — 
denoted as RE-circuit — contains at most one signal at a time. 



2.2 STCA Model Embedding the Rotary Element 

Here we explain an STCA model that embeds any arbitrary RE-circuit, whereby 
each part of a cell state takes 2 bits. The transition rules are depicted in Fig. 5. 

f 1 ) □ - □ ( 2 ) GH If < 3 > O - S < 4 > m ~ Q 

(5) M « -» [j^H 

Fig. 5. Transition rules to implement a Rotary Element. The resulting STCA is 
computation-universal. A hlled circle denotes a 1-bit; a 0-bit is not shown. 



A signal is defined by Rule #1, and is called a D-signal. A D-signal is used 
to conduct ordinary computation. Fig. 6 shows how to transmit a D-signal to- 
wards the north on a straight path of continuous cells, all bits of which are 0; 
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transmitting a signal towards the east, the south, and the west are done in a 
similar way, due to the rotation-symmetry of transition rules. 

A few basic signal routers are defined from Rules #2-4: an input /output 
multiplexer ( IOM ), a merge, and an LR-tum for D-signal. An IOM is just a right- 
turn element for a D-signal that works for signals coming from two directions 
(see Fig. 7(a)); it has one input path, one output path, and one bi-directional 
path. An input signal on the input path is output to the bi-directional path (Rule 
#2), and an input signal on the bi-directional path is output to the output path 
(Rule #3). A merge is an operator with two input paths and one output path 
(see Fig. 7(b)), which redirects a D-signal arriving from one of the input paths 
to the output path (Rules #2, 4). An LR-turn for D-signal is an operator with 
two bi-directional paths (see Fig. 7(c)), which redirects a D-signal arriving from 
one path to the other path (Rules #3, 4). These three elements are realized by 
a simple pattern on the STCA, a stable quadruple that consists of four 1-bits 
bonded together (see Fig. 7). 

A rotation bar, realized by a row or a column of two stable quadruples, is 
introduced in analogy to that of RE (see Fig. 3) . A signal coming from a direction 
orthogonal to the bar will turn right and rotate the bar by 90 degrees (Rule #5). 




(a) IOM (b) Merge (c) LR-Turn 

for D-signal 



Fig. 6. Signal transmission 
northwards. The shaded cell 
in each configuration will be 
updated according to Rule #1. 



Fig. 7. The operations of a stable quadruple, as (a) 
an IOM, (b) a merge, and (c) an LR-turn for D- 
signal. Each arrow denotes a signal path, and the 
character D alongside it denotes the signal’s type. 



An RE can be composed of stable quadruples as shown in Fig. 8. In this 
implementation, an input/output path pair on each side of an RE is represented 
by a bi-directional path. By connecting IOMs to the bi-directional paths of the 
RE in Fig. 8, a regular RE like in Fig. 3 can be constructed. Fig. 9 illustrates 
the traces of an RE operating on a signal coming from its input line (a) parallel 
or (b) orthogonal to the rotation bar. The STCA is computation-universal in 
the sense that a universal Turing machine composed of REs (see [13]) can be 
embedded on it. 

A stable quadruple is the smallest element in circuits implemented on the 
STCA, since the basic elements such as an IOM, a merge, and an RE can be con- 
structed from it (see Figs. 7 and 8). In this paper, we assume that any circuit on 
the STCA is represented by a specific configuration of stable quadruples, so that 
a universal constructor can be regarded as a machine that puts stable quadruples 
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Fig. 8. Realization of an RE on the Fig. 9. A signal passing through an RE: (a) 
construction-universal STCA. the parallel and (b) the orthogonal case. 



on desired places. The rules required for this (see Fig. 10) are explained in the 
remaining sections. 
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Fig. 10. The transition rules in the STCA, which are used to realize a construction 
arm and a tape unit. A filled circle denotes a 1-bit and an open circle denotes a 0-bit, 
but most 0-bits are not shown except when required to avoid ambiguities. 



3 The Construction Arm 

The construction arm, a part of the universal constructor, plays the main role 
in construction: it moves a writing head over the cell space, and constructs 
desired structures. In Fig. 11, (a) the initial state and (b) the working state are 
illustrated. The initial state is composed of only stable quadruples. The I/O 
path / is used for initialization of the construction arm; L, R and D are used to 
operate the writing head. The initializing process of the arm is described after 
we describe how it works. 
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Flexible Path 




♦ * r t 

R D R D 



(a) initial (b) working 

Fig. 11 . Configurations of the construction arm in (a) the initial state and (b) the 
working state. There are four I/O paths I , L, R, and D. Each I/O path is represented 
as an solid arrow or a broken arrow, the former denoting it is available in the particular 
(initial or working) state of the arm and the latter denoting it is not. The characters 
alongside each arrow denote what kinds of signals can be transmitted on it. 



A construction arm in the working state (see Fig. 11(b)) can be divided into 
three parts: a flexible path, which is a bi-directional path connecting with the 
other two parts, a signal converter, and a writing head. In the construction arm, 
three kinds of signals, a D-, an R-, and an L-signal, are used to transport infor- 
mation to (from, respectively) the writing head from (to) the signal converter 
through the flexible path. The L-signal and the R-signal are each represented 
by a 1-bit, and transmitted similarly to a D-signal, according to Rules #6 and 
#7, respectively (see Fig. 10). If a D-signal is input to the I/O path L, R , or 
D , it is converted into, respectively, an L-signal, an R-signal, or just redirected 
as is, after which the L-, R-, or D-signal is fed to the writing head through the 
flexible path. If an L-, an R-, or a D-signal is returned from the head, a D-signal 
is output to the path L, R, or D , respectively. 

On a flexible path, an LR-turn for all signals defined by Rules #8-13 (see 
Fig. 10) is used to change the direction of a path transmitting a D-, an L-, or an 
R-signal. Unlike an LR-turn for D-signal (Fig. 7(c)), this LR-turn is represented 
by two stable quadruples, and turns all kinds of signals as well as the D-signal 
(see Fig. 12(a)). 

A signal converter is represented by the combination of a D-LR multiplexer 
and a D-LR converter (see Figs. 11(b) and 12(b), (c)). A D-LR multiplexer 
(Fig. 12(b)) is also an LR-turn for D-signal (Fig. 7(c)), but an L/R.-signal arriving 
from the left path (the right path, respectively) is unaffected by it, and is just 
output to the right path (the left path) (see Rules #14-17). A D-LR converter 
(Fig. 12(c)) is an operator that converts a D-signal arriving from the upper path 
(or the lower path, respectively) into an L-signal (or an R.-signal), and outputs 
it to the right path (see Rules #18, 19). Conversely, an L-signal (or an R-signal, 
respectively) arriving from the right path is converted into a D-signal and is 
output to the upper path (or the lower path) (see Rules #20, 21). 

A writing head (Fig. 12(d)) can put stable quadruples on its reach. If a D- 
signal is fed to the writing head, it creates a stable quadruple and returns a D- 
signal (see Fig. 13(a)i and Rule #22). With more D-signal(s), the writing head 
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(a)LR-Turn (b) D-LR Multiplexer (c) D-LR Converter (d) Writing Head 

Fig. 12. Elements used for the construction arm: (a)-(c) are stable patterns, but (d) 
a writing head can be moved by an input signal. 



creates any combination of stable quadruple(s) on its reach (see Fig. 13(a)ii, iii 
and Rules #23, 24). If an L-signal is fed to the writing head, the arm is extended, 
and an L-signal is returned (see Fig. 13(b)i and Rules #25, 26); to change the 
direction of the arm to the left (respectively, the right), a D-signal (respectively, 
two D-signals) and an L-signal are subsequently fed to the head (see Fig. 13(a)i, 
ii, (b)ii, iii, and Rules #26-28). This allows the flexible path to form an arbitrary 
shape without crossing. Feeding an R.-signal to the head results in the withdrawal 
of the arm, and an R.-signal is returned; any configurations written by the head 
on its reach will be kept intact, however (see Fig. 13(c) and Rules #29-35). 



(a) 



D. 



23 



(b) 



25,26, 




27,26, 




(c) 









28,26, 

11 





Fig. 13. The operations of a writing head: (a) creation of stable quadruples, (b) ex- 
tension of the arm, and (c) withdrawal of the arm. Each thin arrow represents an 
input/output signal. Integers or * above each thick arrow denote the transition rules, 
where * £ {29,31,32,33}. In (c) withdrawal, the shaded bits can be all 0-bits or can 
contain one or two stable quadruple(s). 



In the initial state of the construction arm (see Fig. 11(a)), a D-signal first 
has to be input to the I/O path J, creating a D-LR converter from three stable 
quadruples (see Fig. 14 and Rule #36). As the path / is only used to initialize the 
writing head, we leave out I from the figures in the following. After initialization, 
a D-signal is sent to L to create the head (see Fig. 15(a) and Rule #37). I/O paths 
D and R are not available until the head is created. Destruction of the writing 
head is shown in Fig. 15(b): unlike the usual withdrawal, an L-signal is returned 
due to Rule #38. To rewind the extended arm, it suffices to continuously give 
an R.-signal to the head until an L-signal is returned from it, which indicates 
that the rewind is completed. 
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(a) 



37,26, 

15 



L 


1 •• 


j 




;* 





(b) 



R 




16,29, 

38 




Fig. 14. Creation of a D-LR Fig. 15. (a) Creation and (b) destruction of a writing 

converter from three stable head. 

quadruples. 



An example of constructing a construction arm is shown in Fig. 16. For this 
construction, the following command sequence is used: 

LLDLLLDDLLLDDRRDDDRRDDLLLDDRRDD, (2) 

where each character means giving a signal to the corresponding I/O path. The 
construction arm should be rewound after construction, but this is not shown. It 
is clear that the arm can construct every structure composed of stable quadruples 
in this way, as long as an appropriate command sequence is given. 





Fig. 16. Example of constructing a construction arm (Fig. 11(a)) by the command 
sequence in (2). The LHS represents the configuration in which the first command L 
has already been executed. 



4 The Tape Unit 

The tape unit operates the cursor on the tape, and conducts reading/writing 
operations on it (see Fig. 17). A tape is represented as an infinite series of bits, 
each of which is represented by the presence (1-bit) or absent (0-bit) of a stable 
quadruple. The tape unit resembles the construction arm: the cursor is the same 
as the writing head, and moves forward and backward by an L- and an R- 
signal, respectively (see Fig. 13(b)i, (c)i). Note that two L-signals are needed for 
initialization, i.e., to set the cursor on the left end of the tape ( zero position). 

The reading operation is conducted by executing the command sequence 
DL. Fig. 18 shows the situation after processing the D-signal. When an L-signal 
is input, the bit pointed to by the cursor is read and erased: return of an L- 
signal (respectively, an R.-signal) corresponds to a 0-bit (a 1-bit). Reading a 
1-bit like in Fig. 18(b) requires a new rule (Rule # 39). To return the cursor 
to the proper state in which it can conduct its next operation, it needs to be 
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L 0 12 3 
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Fig. 17. The tape unit in (a) the initial state and (b) the working state. The presence 
or absence of a stable quadruple on the tape represents a 1-bit or a 0-bit, respectively. 
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27 , 26 , 
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(a) 0-bit 





Fig. 18. Reading operation of the cursor by the command sequence DL , when the bit 
pointed to is (a) a 0-bit or (b) a 1-bit, resulting in an L- or R-signal, respectively. In 
(b), the bit is cleared after reading. 



Control Unit 



Construction Arm 



Begin 
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1 1* 1 ■n I 



Tape Unit 

Fig. 19. Implementation of universal constructor on the STCA. 



recovered by inputting the sequence R or DR to it (see Fig. 13(a)i, (c)ii). This 
has the (desired) side effect that a 0-bit or a 1-bit, respectively, is rewritten to 
the tape position. 

5 Discussion and Conclusion 

In Sects. 3 and 4, we showed the construction arm and the tape unit implemented 
on the STCA. By connecting the above two parts and their control unit that 
interprets the tape contents and sends appropriate commands to the appropriate 
paths of the construction arm, it is possible to construct a universal constructor. 
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Note that the control unit is a finite state machine, so it can be realized by 
a RE-circuit (see Sect. 2) in a similar way as in [13]. The implementation of a 
universal constructor is shown in Fig. 19, in which the construction arm and the 
tape unit are each equipped with a microcontroller realized by a RE-circuit to 
interface with the control unit, but we cannot describe this implementation in 
detail due to a lack of space (see the extended version of this paper [14] for 
details). 

The construction-universal STCA presented in this paper requires only 39 
rules, and the required configuration is compact, witness the implementation in 
Fig. 19. 
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Abstract. In this paper, we investigate the study of the sequences gen- 
erated by maximum- length 90/150 cellular automata(CA) sequences. We 
propose a method computing the relative phase shifts of all pseudo-noise 
sequences which have the same characteristic polynomial. As a special 
case of this method, we give an algorithm to compute relative phase 
shifts between stages of a CA. This is a very simple and efficient method 
than the already proposed methods. 



1 Introduction 

Cellular automata(CA) are mathematical idealizations of physical systems in 
which space and time are discrete, and each cell can assume the value either 0 
or 1. The cells evolve in discrete time steps according to some deterministic rule 
that depends only on logical neighborhood. For the simplest case, Wolfram [13] 
suggested the use of a simple two-state, 3-neighborlrood one-dimensional CA 
with cells arranged linearly in one dimension. Each cell is essentially comprised 
of a memory element and a combinatorial logic that generates the next-state of 
the cell from the present-state of its neighboring cells(left, right and self). The 
CA-based scheme for generation of pseudoexlraustive patterns has reported in 
[1]- [2] and so on. The phase shift analysis of CA([3], [4], [13]), based on 90/150 
matrices whose characteristic polynomials are primitive, has been investigated 
by Bardell [1]. He calculated the phase shifts between the output sequences 
generated by different stages of a maximum- length 90/150 CA by using discrete 
logarithms of a binary polynomial. Nandi and Clraudhuri [9] proposed a method 
for the study of phase shift analysis based on matrix algebra. They showed that 
every cell position of a maximum-length 90/150 CA generates the same pseudo- 
noise sequence corresponding to the characteristic polynomial of the CA with a 
phase shift. 

Recently, Sarkar [10] gave an algorithm to compute phase shifts. This was 
achieved by developing the proper algebraic framework for the study of CA se- 
quences. In this paper, we study the sequences generated by maximum-length 

* This work was supported by KOSEF:R01-2003-000-10663-0. 
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90/150 CA sequences. And we apply these to phase shifting of sequences gen- 
erated by a maximum-length 90/150 CA. From these applications we give an 
improved method to compute phase shifts, which is different from those meth- 
ods of Bardell’s [1], Nandi and Chaudhuri’s [9], and Sarkar’s [10]. 

2 Preliminaries 

A CA consists of a number of interconnected cells arranged spatially in a regular 
manner, where the state transition of each cell depends on the states of its 
neighbors. The CA structure investigated by Wolfram [13] can be viewed as a 
discrete lattice of sites (cells), where each cell can assume the value either 0 
or 1. The next-state of a cell is assumed to depend on itself and on its two 
neighbors (3-neiglrborhood dependency). If the next-state function of a cell is 
expressed in the form of a truth table, then the decimal equivalent of the output 
is conventionally called the rule number for the cell. 



Neighborhood state 


111 110 101 100 Oil 010 001 000 




Next state 
Next state 


01011010 

10010110 


rule 90 
rule 150 



The top row gives all eight possible states of the three neighboring cells (the 
left neighbor of the ith cell, the 1th cell itself, and its right neighbor) at the time 
of instant t. The second and third rows give the corresponding states of the ith 
cell at the time of instant t + 1 for two illustrative CA rules. On minimization, 
the truth tables for the rules 90 and 150 result in the following logic functions, 
where ® denotes XOR logic and q\ denotes the state of the ztlr CA cell at the 
time of instant t, q\_ t and q\ +1 refer to the state of its left and right neighbors; 

rule 90: q t j +1 = q\_ x ® q\ +1 

rule 150 : q , [ +1 = q\_ x ® q\ ® q\ +1 

Let 1F2 be the Galois field with cardinality 2 and B^x] the set of all polyno- 
mials whose coefficients are members of IF 2 . 

Definition 1. [6] Let f(x) = c 0 + C\X + • • • + c n _iX™ _1 + x n be an n-degree 
primitive polynomial, where Co, Ci, ■ ■ ■ , c„_ 1 £ 1F2- Then f(x) generates a peri- 
odic sequence whose period is 2” — 1. This sequence is called a pseudo-noise(PN) 
sequence. 

The following definitions and theorem are in [7]. 

Definition 2. A sequence {st} is said to be a kth order homogeneous linear 
recurring sequence if 



st+k — cq St + ciSt+i + • • • + Ck-iSt+k-i for t — 0, 1, • • • , (*) 





Computing Phase Shifts 



33 



where Co, Ci, • • • , Cfc-i £ IF2. The characteristic polynomial for the sequence de- 
fined in (*) is defined to be the following polynomial in IF 2 [tc] ; 

f(x) = C 0 + C\X J \ 1- c fc _ xx k ^ 4 + x k . 

Denote the set of all kth order homogeneous linear recurring sequences whose 
characteristic polynomials are /( x) by f2(f(x)). 

Definition 3. A homogeneous linear recurring sequence in 1F2 whose charac- 
teristic polynomial is a primitive polynomial over 1F2 and which has a nonzero 
initial state vector is called a maximum-length sequence in FV 

Maximum-length sequences are of fundamental importance in computer sci- 
ence, cryptology and engineering. 

Theorem 1. Let f(x) be a k-degree polynomial over IF 2 and {st} £ f2(f(x)). 
Then {s*} has the maximum-length 2 k — 1 if and only if f(x) is primitive over 

F 2 . 

3 Computing Phase Shifts 

In this section, we study the sequences generated by maximum-length 90/150 
CA sequences. Each cell position of the CA generates PN sequences [9] . 

Consider the following n x n tridiagonal matrix T n 





(a x 


1 


0 


0 


0 ••• 


... 0 


°\ 




1 


a-i 


1 


0 


0 ••• 


... 0 


0 


T n = 


0 


1 


03 


1 


0 ••• 


... 0 


0 




\ 0 


0 


0 


0 


0 ••• 


... 1 


CLn J 



where ai, 02, • • • , a„ £ IF2. We call T n the state transition matrix for an n-cell 
90/150 CA. Hereafter we write the matrix T n by T n =< ax, 02, • • • , a n >. 

For example, if T4 =< 0, 1,0,1 > is the state transition matrix for a given 
4-cell 90/150 CA C, then the characteristic polynomial is f(x) = x 4 + x + 1, 
which is primitive. The following figure shows the structure of C. 

Tezuka and Fushimi [12] asserted that for a given primitive polynomial /( x), 
there exist exactly two maximum- length 90/150 CA whose characteristic poly- 
nomials are f(x). If T n =< ax, <22, • • • , a n > is a state transition matrix corre- 
sponding to f(x), then the other is T' n —< a n , a n _ 1, • • • , aq >. 

For example, let f(x) = x 4 + x + 1. Then T4 =< 0,1,0, 1 > and T4 =< 
1,0, 1,0 > are state transition matrices corresponding to f(x). 

For any n-cell 90/150 CA whose state transition matrix is T n , the minimal 
polynomial for T n is the same as the characteristic polynomial for T n ([ll]). 
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Fig. 1. The structure of 90/150 CA C with T 4 



Theorem 2. Let T n be the state transition matrix of an n-cell maximum-length 
90/150 CA. Then there exists p{ 1 < p < 2™ — 2) such that I n ®T n = T/. 



Corollary 1. Let T n be the state transition matrix of an n-cell maximum-length 
90/150 CA. Then there exists k(l < k < 2™ — 2) such that T: * © T/ +1 = I n . 



Example 1. We consider a 4-cell 90/150 CA whose state transition matrix T 4 is 
as the following : 

0 10 0 



0 0 11 

The characteristic polynomial of X 4 is f(x) = x 4 -\- x + 1, which is primitive. 
We obtain T 4 n 0 T 4 12 = I 4 . 

From now on, we will simply write T n by T. 

Theorem 3. Let T be the state transition matrix for a given n-cell maximum- 
length 90/150 CA. Then the ith row of T h and T m are always different, where 
0<z<n-l,l</i<m<2 n -2. 




Theorem 4. Let T be the state transition matrix for a given n-cell maximum- 
length 90/150 CA and let wq / (0, 0, • • • , 0) be the initial configuration ofT. 
Then for any l<i<j<n — 1, there exists an integer h such that q t / rh = q\ 
for all t > 0, where q \ denotes the state of the ith cell at the time of instant t. 
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Let T be the state transition matrix for a given n-cell maximum-length 
90/150 CA and let vjq = (1,0, •••,0) be the initial configuration of T. Then 
we obtain a (2" — 1) x n matrix A consisting of n independent PN sequences 
generated by T as its columns. The sum of some columns of A is also another 
PN sequence ([6]). The number of PN sequences generated by n columns is equal 

to n C\ + n C '2 H 1- n C n = 2 n — 1. Thus we can get a (2" — 1) x (2 ra — 1) matrix 

whose columns consist of all PN sequences generated by A. Such a matrix is 
referred to as matrix M. 

For example, let T =< 0, 1,0, 1 >. Then A and M are given in Table 1. 



Table 1. Matrices A and M for T=<0,1,0,1> 





A 


M 


0 


1000 


100011100011101 


1 


0100 


010010011011011 


2 


1110 


111000101110001 


3 


1111 


111100000011110 


4 


1100 


110001111000110 


5 


1010 


101010110101010 


6 


0001 


000100101101111 


7 


0011 


001101111011000 


8 


0110 


011011001101100 


9 


1011 


101110011000101 


10 


0010 


001001010110111 


11 


0101 


010110110110100 


12 


1101 


110101010101001 


13 


1001 


100111001110010 


14 


0111 


011111100000011 



For two configurations v and w of T, we define the row phase shift k of v 
with respect to w such that T k v = w. In Table 1 the row phase shift of v with 
respect to w is 12, where w = (1, 1, 0, 0) and v = (0, 0, 1, 1). 

The relative phase shift of one column of M with respect to the other is 
specified in the following theorems. The PN sequence generated by some cell 
positions of the CA C is where a* is the dependency of g*. 

For example, let ao = a± = 1 and <22 = 0, then Y2i=o a^g,; = go©<?i- Therefore 
go © gi = {go © Qi} = 1100101 • • • is the 3rd column of M. 

Theorem 5. Let T be the state transition matrix of an n-cell maximum-length. 
90/150 CA and let w = a i w i anc ^ v = Sj=o bj w j> where w\ is the trans- 

i 

pose of Wi = (0, 0, • • • , 0, 1, 0, • • • , 0) (0 < * < n— 1). Then there exists an integer 
r such that 



T r w*®T r+1 w t = v t . 
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Proof. We can find (3 such that = u*. By Theorem 2 there exists p such 

that (/„ ® T) = T p . Let r = f3 — p ( mod 2” — 1). This completes the proof. 

Theorem 5 says that —{3 is the row phase shift of v with respect to w of A. 

The following theorem gives a method to compute phase shifts. 

Theorem 6. Let T be the state transition matrix of an n-cell maximum-length 
90/150 CA C. Let s and u be given two columns of M, where s = X^=o 
and u = Y^j = o bjQj- If h is the phase shift of u with respect to s, then h = — (r + 
p) ( mod 2” — 1), where r and p are in Theorem 5 and Theorem 2, respectively. 

Proof. Let A be the (2™ — 1) x n matrix consisting of n independent PN sequences 
generated by T as its columns and let (1, 0, - - - , 0) be the initial configuration 
of T. Let w = £r=o cLiWi and v = bj w j- Since T is symmetric, the phase 

shift of u with respect to s is equal to the row phase shift of v with respect to 
w. Therefore h = — (r + p) ( mod 2” — 1). 

Example 2. Let C be the given CA with T =< 0,1,0, 1 > and (1,0,0, 0) the 
initial configuration of T. Then we obtain matrices A and M as Table 1. Since 
the characteristic polynomial of T is x 4 + x + 1, p = 4 in Theorem 2. Let s = 
(1,1, 0,0, 0,1, 0,0, 1,1, 0,1, 0,1,1)* and u = (0, 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 1, 1, 1, 0)* 
in M. If we put w = (1, 1, 0, 0) and v = (0, 0, 1, 1), then by Theorem 5 r = 14. 
Hence h = —(14 + 4) = 12 (mod 15). The phase shift of u (which is the sum of 
the 2nd cell position and the 3rd cell position of C) with respect to s (which is 
the sum of the 0th cell and the 1st cell position of (D) is 12. 

Let s be the sequence combined the 0th column with the 1st column and u 
the sequence combined the ( n — 2)th column with the (n — l)th column of the 
matrix A obtained by the given CA rule and the initial vector (1,0, •••,0). 
Phase shifts of the sequence u with respect to the sequence s are given in 
Table 2. 

4 Algorithm to Compute Phase Shifts 

Now we give an algorithm to find the phase shifts in a given n-cell maximum- 
length 90/150 CA. 



Algorithm FindPhaseShifts 

Input: The state transition n x n matrix T, initial vector w = (1, 0, • • • , 0). 
Output: phaseshift[n]. 

Step 1. marki xn by 0; mark [0]=1; 

power=l; plraseshift[0]=0. 

Step 2. While (all mark / 1) do step 3 to step 5. 

Step 3. w* = Tw*. /* Run the CA */ 
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Table 2. Phase shifts of the sequence u with respect to the sequence s 



Degree 


CA rule 


Phase shift 


3 


110 


3 


4 


0101 


12 


5 


01111 


8 


6 


000110 


23 


7 


1011001 


75 


8 


01001011 


9 


9 


010011100 


44 


10 


1111000011 


994 


11 


01000011010 


1426 


12 


100101010011 


3882 


13 


0111001110110 


7649 


14 


01000111001111 


14568 


15 


100000011000001 


9538 


16 


0001111001001000 


63960 


17 


10011000110011001 


77544 


18 


110001000000010011 


241877 


19 


1101011101101001011 


282318 


20 


01101011100001010110 


314775 


21 


010010011001010010010 


1676666 


22 


0100011010101101100010 


1345981 


23 


01011010101100101011010 


6095661 


24 


110100111100100111001011 


199263 


25 


1010000011111011100000101 


17370017 


26 


11001101111101111010110011 


66236246 


27 


000110100110001011101011000 


132625967 


28 


0101101110000001100111011010 


195968798 


29 


01001000100101111100100010010 


205911726 


30 


101000100111001101101010000101 


404894385 


31 


1111101101100001100011011011111 


2015461719 


32 


00001100010001110000110000000110 


25871503 



Step 4. If ( w contains single 1) 
then mark [position of 1]=1; 

phaseshift [position of 1 ] = — power (mod 2" — 1). 
Step 5. power=power+l. 



There does not need the Shank’s algorithm for the completion of this algo- 
rithm any more. The algorithm that we propose does not need any previous 
phase shifts. Whereas, it is required to compute all previous in Sarkar’s method 
([10]) which adopted the Shank’s algorithm in order to get the phase shifts of 
the ith column with respect to the 0th cell. So, we can see that this is very 
practically useful for sufficiently large n. 
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Table 3. Phase shifts with respect to the Oth cell of the CA whose rule is < 
00001100010001110000110000000110 > 



Degree 


CA rule 


Phase shifts 


32 


0000110001000111 


0, 4294967294, 3963262907, 


4294967292 




0000110000000110 


2182065471, 


3478064023, 


2842396500 






3797410740, 


2636154424, 


1477132997 






2453647807, 


3833928247, 


4122644326 






2445882768, 


3715941894, 


3131603603 






3781145264, 


724531189, 


1964528637 






1178642835, 


1437488410, 


2132417369 






2228497937, 


2438002527, 


3823282243 






3142683718, 


4037203264, 


3657430022 






496625232, 


3264387886, 


25871502 






25871503 







In Table 3 there are all phase shifts with respect to the Oth cell of the CA 
whose rule is < 00001100010001110000110000000110 >. 

5 Conclusion 

In this paper, we studied the properties of sequences generated by maximum- 
length 90/150 CA sequences and applied these results to the phase shifting of 
PN sequences generated by it. From these applications we proposed a method 
for the study of phase shift analysis. As a special case of this method, we gave 
an algorithm to compute relative phase shifts between stages of a CA. This 
algorithm is different from the algorithms of Bardell’s [1], Nandi and Chaudhuri’s 
[9], and Sarkar’s [10]. Our algorithm is more practical than them. 
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Abstract. This paper reports design of a pattern classifying machine 
(PCM) for distributed data mining (DDM) environment. The proposed 
PCM is based on the computing model of a special class of sparse network 
referred to as Cellular Automata (CA). Extensive experimental results 
confirm scalability of the PCM to handle distributed datasets. The ex- 
cellent classification accuracy and low memory overhead figure establish 
the proposed PCM as the classifier ideally suited for DDM environments. 

1 Introduction 

The inter-networked society has been experiencing an explosion of data in many 
distributed environments. However, the explosion is paradoxically acting as an 
impediment in acquiring knowledge. The meaningful interpretation of different 
distributed sources of voluminous data is increasingly becoming difficult. Conse- 
quently, researchers, practitioners, entrepreneurs from diverse fields are focusing 
on development of sophisticated techniques for knowledge extraction, which leads 
to the promising field of distributed data mining (DDM) [1,2,3]. 

Due to high response time, lack of proper use of distributed resources, and 
its inherent characteristics, the conventional centralized data mining algorithms 
are not suitable for DDM environment. Most DDM algorithms are designed for 
parallel processing on distributed data. Similar algorithms are applied on each 
distributed data source concurrently, producing one local model per source. All 
local models are next aggregated to produce final model. 

One of the important problems in data mining is classification. Most dis- 
tributed classifiers in DDM work in ensemble learning approach, which produces 
multiple classifiers (base classifiers) at different data sources. Results derived 
out of multiple classifiers are next combined at a central location to enhance 
the classification accuracy of predictive models. Majority voting scheme is ap- 
plied to aggregate base classifiers. Meta-learning framework offers another way 
to aggregate these base classifiers. So, the most important issues in DDM are 
the aggregation of the base classifiers and the communication cost from different 
local data sources to the central location where final classification is performed. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 40—49, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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Data Set S = { Sj , S 2 , Sj^} having K number of distinct classes distributed at N sites 



Trai ning Phase at local sites 



PCM, 




PCM-? 




PCMj 
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(1st Base 




(2nd Base 




(ith Base 




(Nth Base 


Classifier) 




Classifier) 




Classifier) 




Classifier) 



PCMj PCM 2 pcm . PCM n 



HHF = { PCMj } ( i = 1, 2, N ) 

Aggregation — > Majority Voting 

Testing Phase at Central Location 



> Class S; e S 



Note: The set of CA based PCM (Pattern Classifying Machine) 
constitute a Hamming Hash Family (HHF) 



Fig. 1 . Architecture of a PCM for distributed data mining (DDM) environment 



In the above background scenario, the essential prerequisites of designing a 
classifier for DDM environment are: (i) base classifier at each local site must 
have high classification accuracy; (ii) it must be scalable to handle disk resident 
datasets; (iii) it should have high throughput with less storage requirements; (iv) 
volume of data transfered from each local to central site should be minimum; 
(v) aggregation of base classifiers must induce globally meaningful knowledge; 
and (vi) the algorithm should be simple enough to have a low cost hardwired 
implementation for on-line operation in the inter- networked society of cyber age. 

In this paper, we propose the design of a pattern classifying machine (PCM) 
to address the problem of DDM. At each local site, we design a CA based PCM 
as a base classifier. The majority voting scheme is next implemented to aggregate 
the results of the base classifiers and predict the class of the data element. The 
architecture of the majority voting scheme is developed around the CA based 
hamming hash family (HHF) [4]. Each PCM is a member of this HHF. The 
probability of collision between a pair of patterns varies inversely with their 
hamming distance (HD) while hashed with HHF. This property of HHF is used 
to design the majority voting scheme for distributed data mining (DDM). 

Next section presents an overview of CA based pattern classifying machine 
(PCM) for distributed data mining (DDM) environment. 

2 PCM for DDM Environment 

In this section, we present the architecture (Fig. 1) of a pattern classifying ma- 
chine (PCM) for distributed data mining (DDM). Let the data which have to 
be classified into different classes S = (Si, 5*2, • • • , Sk} are distributed among 
N number of different sites. For a given feature/metric of the dataset, a PCM 
is generated as a base classifier (PCM.;/, where i = 1, 2, • • • , N) at each local 
data site. The set of PCMs are next copied at a central location. The majority 
voting scheme is employed to aggregate the results of these PCMs. The prop- 
erty of hamming hash family (HHF) [4] is exploited in majority voting scheme 
which enhances the classification accuracy at central location. Thus the training 
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phase of the classifier is implemented at a local site to design the PCM, while 
the testing phase to identify the class of a data element is implemented in the 
central location. Communication between local site and central location takes 
place only if data at local site gets updated and a new PCM is generated for 
classification of updated dataset. 

The algorithm of classification for distributed data mining (DDM) environ- 
ments proceeds as follows. Let the number of PCMs (base classifiers) be N, 
where each PCMi has classified a dataset into K number of distinct classes 
S = {Si, 1 S 2 , • • • , Sk} available at a local site i. Any incoming data Vi ncom is 
hashed with each of the PCMs in a central location. At each instance of hashing, 
Vincom collides with a class Sj £ S. Each Sj £ S records the count of the num- 
ber of times it collides with Vi nC om ■ Once the process of hashing in all PCMs is 
completed, the class Sj with which Vincom has collided for maximum number of 
times is recorded as the class of Vi ncom . The sequential steps of the algorithm 
follows. 

Algorithm 1 . Identification_of_the_class _in_a JDDM_environment 
Input: An incoming data Vi nC om input to PCMi (i = 1, 2, • • • , N). 

Output: Class Sj of the data classes S = (Si, • ■ ■ , Sk}- 
begin: 

for i = 1 to N 

hash Vincom with PCMi 

pick Sj with which Vincom collides for maximum number of times 
output Sj. 

In rest of the paper we concentrate on the design of hamming hash family 
(HHF) - a set of pattern classifiers which behave as the base classifiers for dif- 
ferent local sites. The classifier design is based on a special class of CA, termed 
as Multiple Attractor CA (MACA) [4,5]. For the sake of easy reference a brief 
introduction of MACA follows next. 

3 Multiple Attractor CA (MACA) 

A Cellular Automaton (CA) consists of a number of cells. In a 3-neiglrborhood 
dependency, the next state (pit + 1) of a cell is assumed to be dependent only 
on itself and on its two neighbors (left and right), and is denoted as 

Qi(t + 1) = /(5i-i(t),9*(t),9i+i(t)) (1) 

where (pit) represents the state of the i th cell at t th instant of time. / is the next 
state function and referred to as the rule of the automata. Since / is a function 
of 3 variables, there are 2 2 i.e. , 256 possible next state functions (rules) for a CA 
cell. Out of 256 rules there are only 7 rules with XOR logic. The CA employing 
XOR rule is referred to as linear CA. The pattern classifier proposed in this 
paper employs a special class of linear CA referred to as MACA [4,5]. 

The state transition graph of an MACA consists of a number of cyclic and 
non-cyclic states. The set of non-cyclic states of an MACA forms inverted trees 
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rooted at the cyclic states. The cyclic states with self loop are referred to as 
attractors. Fig. 2 depicts the state transition diagram of a 5-cell MACA with 
four attractors {00000(0), 00011(3), 00100(4), 00111(7)}. The states of a tree 
rooted at the cyclic state a forms the a-basin. 
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Fig. 2. State transition diagram of a 5-cell MACA. Four cyclic states 0, 3, 4, 7 are 
referred to as attractors and the corresponding inverted tree rooted on a (a = 0, 3, 4, 
7) as a-basin. 



The detailed characterization of MACA is available in [4,5]. A few fundamen- 
tal results for au n-cell MACA having k number of attractor are next outlined. 
Result I: 

Definition 1. An m-bit field, of an n-bit pattern set is said to be pseudo- 
exhaustive if all possible 2 m patterns appear in the set. 

Theorem 1. In an n-cell MACA with k = 2 m attractors, there exists m-bit 
positions at which the attractors generate pseudo-exhaustive 2 m patterns. 

Result II: An n-bit MACA with 2-attractor basins can be represented by 
an n-bit binary string, termed as Dependency Vector (DV). If DV is an n-bit 
Dependency Vector and V is an n-bit pattern, then the modulo-2 sum (XOR) 
of the dependent variables of V (where DV contains l’s) is equal to zero if V 
belongs to zero basin; otherwise 1. That is, 

D y _ f 0, if V £ zero basin . 

} 1, if V £ non-zero basin ' 



Result III: An n-bit MACA with 2 m -attractor basins can be represented 
by an n-bit Dependency String (DS). An n-bit Dependency String DS is pro- 
duced through concatenation of m number of Dependency Vectors of length 
ni, U 2 , ■ ■ ■ , n m respectively (Fig. 3), where ni + U 2 + ■ • • + n m = n and V is an 
n-bit pattern whose attractor basin is to be identified. For each DV., (of length 
nf), the dependent variables of the corresponding ni bits of V (say Vf) gives 
either 0 or 1, - that is, 

nv _ JO, if Vi £ zero basin of DVi 

1 * } 1, if Vi £ non-zero basin of DVi 
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DV> 




DV m -l 


DV m 



n [ n 2 n m - 1 n m 

Fig. 3. An n-bit Dependency String (DS) consists of m number of Dependency Vectors 
(DVs). Each Dependency Vector (DV) contributes the value of each pseudo-exhaustive 
bit (either 0 or 1) of an attractor basin. 



which indicates the value of i th pseudo-exhaustive bit. Finally, a string of m 
binary symbols can be obtained from m number of DVs. This m-bit binary 
string is the pseudo-exlraustive field (PEF) of the attractor basins where the 
pattern V belongs. That is, the pseudo-exhaustive field (PEF) of the attractor 
basin of V is given by 



PEF = DS • V = [DV L • Vi\[DV 2 ■ V 2 \ ■ ■ ■ [ DV m ■ V m \ (3) 

where DS and V - both are n-bit vectors. So, the complexity of identifying the 
PEF of an attractor basin is O(n). This specific result is of significant importance 
for our design. It enables the scheme to identify the class of an input element 
with linear complexity. 

3.1 Hamming Hash Family 

An MACA can act as an effective hash function. The total pool of MACA forms 
a hash family and exhibits an interesting property: the hash family maintains 
an inverse relationship between the collision of a pair of patterns while hashed 
and their hamming distance [4]. The hash family having this property is referred 
to as hamming hash family (HHF). The concept of HHF imparts an important 
property to the attractor basin of MACA. Since keys (patterns) that collide lie 
in the same attractor basin, the keys (patterns) which lie close to each other in 
terms of hamming distance (HD) belong to the same basin. 

3.2 MACA As a Pattern Classifier 

An n-bit MACA with fc-attractor basins can be viewed as a natural classifier. 
It classifies a given set of patterns into k number of distinct classes, each class 
containing the set of states in the attractor basin. To enhance the classification 
accuracy of the machine, most of the works [4,5] have employed MACA to classify 
patterns into two classes (say I and II). The following example illustrates an 
MACA based two class pattern classifier. 

Example 1. Let, the MACA of Fig. 2 be employed to classify patterns into 
two classes (say I and II), where Class I is represented by one set of attractor 
basins (say [I]= {00100 and 00111} in Fig. 2) while Class II is represented by 
the rest of the basins (say [II] = {00000 and 00011}). All the patterns in the 
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attractor basins [I] belong to Class I while rest of the patterns belong to Class 
II. As per Theorem 1, the pseudo-exhaustive field (PEF) will identify the class 
of the patterns uniquely. The PEF yields the address of the memory that stores 
the class information. Therefore, Class I attractors yield the memory address 
{10, 11}, while Class II will be identified by the memory address {00, 01} (Fig. 
4). When the MACA is loaded with an input pattern V, corresponding MACA 
identifies the PEF of the attractor basin where V belongs. 



o o o o o o 
0 6 6 

6 6 6 

PEF (bits) 

PEF (bits) 



PEF ( bits ) 

PEF ( bits ) 

Fig. 4. MACA based classification strategy with 4 attractor basins classifying the el- 
ements into two classes. The PEF (Pseudo Exhaustive Field) of the attractor of a 
basin is extracted to determine the class. Also, the PEF of each attractor points to the 
memory location that stores the class information. 



=> i u n 

, i 10 J * classI 

01 1 =* dass n 

, -n °° - 

MEMORY 



O O 
6 



Genetic Algorithm (GA) formulation to arrive at the desired MACA realiz- 
ing this specific objective has been proposed in [4] with 0(n 3 ) complexity. The 
design reported in the next subsection achieves classification with linear com- 
plexity. To enhance classification accuracy of the machine, we have refined the 
approach reported in [4] and report a new CA based classifier. It can classify an 
n-bit pattern with O(n) complexity. Multi-class classifier is built by recursively 
employing the concept of two class classifier. MACA based classifier proposed 
here has two distinct stages and hence referred to as Two Stage Classifier (Fig. 5). 

3.3 MACA Based Two Stage Classifier 

The design of MACA based classifier for two n-bit pattern sets S i and S 2 should 
ensure that elements of one class (say £1) are covered by a set of attractor basins 
that do not include any member from the class £2- Any two n-bit patterns 
Pi £ £1 and V -2 £ S 2 should fall in different attractor basins. 

Let, an n-bit 2 m -attractor basins MACA can classify two n-bit pattern sets 
£1 and £2. That is, 

DS-P 1 ^DS- V 2 (4) 

where DS is an n-bit Dependency String consisting of m number of Dependency 
Vectors (Fig. 3). Then, the total number of attractor basins will be 2 m and the 
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pseudo-exhaustive field (PEF) (Theorem 1) of each attractor basin will be an 
m-bit binary pattern/string. Let, k\ and k 2 be two ?n-bit pattern sets consist- 
ing of pseudo-exlraustive bits of attractors of two n-bit pattern sets S\ and S 2 
respectively. Then, k-\ and k 2 can also be regarded as two m-bit pattern sets for 
two class classification. So, we synthesize an MACA based two class classifier 
such that one class (say k\) belongs to one attractor basin and another attractor 
basin houses the class k 2 . Any two m-bit patterns p\ € k\ and p 2 £ k 2 should 
fall in different attractor basins (while one in zero basin, another in non-zero 
basin), - that is, 

DV ■ pi # DV ■ p 2 (5) 

where DV is an m-bit Dependency Vector. 




Input Layer 
( n-bit ) 



Classifier #1: n— bit Dependency String consisting of m number of Dependency Vector! 
Classifier #2: m-bit Dependency Vector 



Fig. 5. Two Stage Classifier 



Fig. 5 represents the architecture of Two Stage Classifier. It consists of three 
layers - input, hidden and output layers denoted as Xi (i = 1, 2 , • • • , n), yj ( j = 
1,2, • • • , m), and Ok (k = 1) respectively. While the first classifier (Classifier#l 
of Fig. 5) maps an n-bit pattern of the input layer into an m-bit pattern (PEF) 
of the hidden layer, the second classifier (Classifier#2) maps that m-bit pattern 
into a single bit (either 0 or 1) of the output layer. That is, Classifier# 1 provides 
an appropriate mappings of patterns of input layer into PEF (pseudo-exhaustive 
field) of the hidden layer and the Classifier#2 implements the classification of 
the PEFs rather than the original patterns. 

Let, x be an n-bit input pattern whose class is to be identified by MACA 
based Two Stage Classifier. At first, x is loaded with the Classifier#l which 
outputs an m-bit pattern y (pseudo-exhaustive field of attractor of the basin 
where x belongs). That is, 

y = DS ■ x (6) 

Next, y is loaded with the Classifier#2 which gives a single value o (pseudo- 
exlraustive field of attractor of the basin where y belongs) that determines the 
class of the input pattern x. That is, 

o = DV ■ y (7) 

In order to evolve MACA based Two Stage Classifier realizing this design ob- 
jective, we have developed a special type of GA formulation. Next section reports 
the performance of MACA based Two Stage Classifier in DDM environment. 






Cellular Automata Evolution for Distributed Data Mining 



47 



4 MACA Based PCM for DDM Environment 



The pattern classifying machine (PCM) is designed with a set of Two Stage 
Classifiers. This section provides a detailed analysis of performance evaluation 
of MACA based PCM. The major metric to evaluate the performance of the 
proposed PCM is classification accuracy - the percentage of test data which can 
be correctly classified into different attractor basins. To evaluate classification 
accuracy, we perform the experiments on synthetic dataset proposed in [4] . 

The characterization of MACA basins reported in the Section 3.1 has es- 
tablished following facts: (i) a basin has patterns which are close to each other 
in terms of HD; (ii) patterns satisfying this criterion have high probability of 
getting classified into minimum number of attractor basins; and (iii) patterns 
not covered by the training sets, but satisfying the criteria, will also have high 
probability of being predicted as the right class. 

As a corollary to the above facts, the accuracy of classifying two classes of 
patterns in distinct attractor basins increases if the average distance between 
them is high. In this context, to characterize the datasets to be classified in two 
classes, we introduce two terms D m i n and d max . D m i n is the minimum hamming 
distance (inter-cluster distance) between two patterns V\ £ Si and V 2 £ S 2 in 
two classes Si and S 2 respectively; while d max is the maximum distance (intra- 
cluster distance) between two patterns within a class. The experimental set up 
and dataset employed for analysis have been reported in [4]. For the sake of 
performance analysis, the distributions of patterns in two classes are assumed 
as shown in Fig. 6. 




Fig. 6. Distribution of patterns in Class I and Class II (HD denotes hamming distance) 



4.1 Experimental Setup 

In each of the above distribution, different values of n (number of bits in a pat- 
tern) and N (number of sites) are taken. For each value of n and N, 50% of the 
data are taken to build up the classification model. The rest 50% data are used to 
test the prediction capability of the classifier. For centralized data mining (DM), 




48 



P. Maji, B.K. Sikdar, and P.P. Chaudhuri 



Table 1 . Classification Accuracy for Curve a — a' and b — b' 



No of 


Curve a — a! 


Curve b — b' \ 


Sites 


Size 


Value 


Classification Accuracy 


Size 


Value 


Classification Accuracy 1 


N 


(n) 


of m 


Centralized DM 


DDM 


(n) 


of m 


Centralized DM 


DDM 


10 


200 


4 


96.12 


98.98 


200 


4 


94.49 


97.89 




600 


5 


96.71 


98.71 


400 


5 


93.12 


97.07 




1000 


7 


96.87 


97.89 


1000 


7 


94.67 


96.95 


20 


200 


4 


96.12 


98.99 


200 


4 


94.49 


98.01 




400 


5 


95.19 


98.21 


600 


5 


95.17 


98.39 




1000 


7 


96.87 


98.03 


1000 


7 


94.67 


97.33 


30 


200 


4 


96.12 


99.41 


200 


4 


94.49 


98.35 




800 


7 


97.06 


99.97 


400 


5 


93.12 


97.77 




1000 


7 


96.87 


99.03 


600 


5 


95.17 


98.54 



Table 2. Classification Accuracy for Curve c — c' 



No of 


Size 


Value 


Classification Accuracy 


Sites ( N ) 


(n) 


of m 


Centralized DM 


DDM 


10 


400 


5 


85.33 


98.89 




600 


5 


86.55 


99.07 




1000 


7 


84.79 


98.63 


20 


200 


4 


84.25 


98.79 




600 


5 


86.55 


99.13 




1000 


7 


84.79 


99.11 


30 


200 


4 


84.25 


99.17 




800 


7 


83.94 


98.69 




1000 


7 


84.79 


99.16 



the PCM is designed based on the 50% of the total dataset centrally available. 
For distributed data mining (DDM) environment, total dataset is equally dis- 
tributed to each local site and then same scheme is employed at each local site to 
synthesize the PCMs. In other words, in each site 50% of available data is used 
to train the PCM. The MACA based PCMs are next collected in the centralized 
location where classification accuracy is tested with the remaining set of 50% 
data not used for training. The classification accuracy of the proposed DDM is 
evaluated through the majority voting scheme reported in the Algorithm 1 of 
Section 2. For each values of N and n, 10 different pairs of datasets are taken. 



4.2 Classification Accuracy 

Tables 1-2 represent the classification accuracy of conventional centralized CA 
based classifier and that of PCM in distributed environments for the dataset 
a — a' , b — b', c — d . Columns I and II of Tables 1- 2 represent the number 
of data sites/sources ( N ) and size of the pattern (n) respectively. Column III 
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represents different values of m (number of pseudo-exhaustive bits), where 2 m 
is the number of attractor basins of the MACA. Column IV depicts the classifi- 
cation accuracy in centralized and distributed environments. The experimental 
results confirm the following facts: (i) the accuracy in DDM is better than that of 
conventional centralized MACA based PCMs; (ii) as the number of sites (N) in- 
creases, the classification accuracy for DDM environment significantly increases 
over centralized classification algorithm; and (iii) the concept of hamming hash 
family (HHF) is reported in Section 3.1. The scheme of Majority Voting in cen- 
tral location establishes the elegance and efficiency of the scheme of aggregation 
of classification result of PCMs derived with HHF characterization. 

5 Conclusion 

This paper presents detailed design of an efficient pattern classifier for DDM 
environments. The proposed model is built around a special class of sparse net- 
work referred to as CA. The excellent classification accuracy and low memory 
overhead figures establish the CA as an efficient classifier for DDM environments. 
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Abstract. We present a survey and a comparison of the quantitative 
and qualitative aspects of the optimum-time synchronization algorithms 
developed thus far for one-dimensional cellular arrays. Several new re- 
sults and viewpoints are also given. 



1 Introduction 

Synchronizing a large scale of cellular automata has been known as the firing 
squad synchronization problem since its development, and the problem was orig- 
inally proposed by J . Myhill in order to synchronize all parts of self-reproducing 
cellular automata [10]. The firing squad synchronization problem has been stud- 
ied extensively for more than forty years [1-12, 14-15, 16-19]. In the present 
paper, we firstly examine the state transition rule sets for the famous firing 
squad synchronization algorithms that give a finite-state protocol for synchro- 
nizing large-scale cellular automata. We focus on the fundamental synchroniza- 
tion algorithms operating in optimum steps on one-dimensional cellular arrays 
in which the general is located at one end. The algorithms discussed herein are 
the eight-state Balzer’s algorithm [1], the seven-state Gerken’s algorithm [2], 
the six-state Mazoyer’s algorithm [6], the 16-state Waksman’s algorithm [18] 
and a number of revised versions thereof. Specifically, we attempt to answer the 
following questions: 

— First, are all previously presented transition rule sets correct? 

— Do these sets contain redundant rules? If so, what is the exact rule set? 

— How do the algorithms compare with each other? 

In order to answer these questions, we implement all transition rule sets 
for the synchronization algorithms above mentioned on a computer and check 
whether these rule sets yield successful firing configurations at exactly t = 2n — 2 
steps for any array of length n such that 2 < n < 10000. In addition, we construct 
a survey of current optimum-time synchronization algorithms and compare tran- 
sition rule sets with respect to the number of internal states of each finite state 
automaton, the number of transition rules realizing the synchronization, and 
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the number of state-changes on the array. With the aid of a computer, the first 
Waksman’s transition rule set is shown to include fundamental errors and the 
set is shown to contain a considerable number of redundancies. Approximately 
ninety-three percent of the rules are deleted from the original transition rule 
set. A number of redundant rules that are not used in the synchronization pro- 
cess are also found in other rule sets. We give the smallest rule set for each 
algorithm. Finally, we present herein a survey and a comparison of the quanti- 
tative and qualitative aspects of the optimum-time synchronization algorithms 
developed thus far for one-dimensional cellular arrays. We indicate that the first- 
in-the-world optimum-time synchronization algorithm proposed by Goto [3, 4] 
and reconstructed by Umeo [12] has a state-change complexity of 0(nlogn). 
Several new results and viewpoints are also given. 

2 Firing Squad Synchronization Problem 

2.1 A Brief History of the Developments of Optimum-Time Firing 
Squad Synchronization Algorithms 

A finite one-dimensional cellular array consists of n cells, denoted by C $, where 
1 < i < n. All cells (except the end cells) are identical finite state automata. The 
array operates in lock-step mode such that the next state of each cell (except 
the end cells) is determined by both its own present state and the present states 
of its right and left neighbors. All cells (soldiers), except the left end cell, are 
initially in the quiescent state at time t = 0 and have the property whereby the 
next state of a quiescent cell having quiescent neighbors is the quiescent state. 
At time t = 0 the left end cell (general) is in the fire-when-ready state, which 
is an initiation signal to the array. The firing squad synchronization problem is 
stated as follows. Given an array of n identical cellular automata, including a 
general on the left end which is activated at time t = 0, we want to give the 
description (state set and next-state function) of the automata so that, at some 
future time, all of the cells will simultaneously and, for the first time, enter a 
special firing state. The set of states must be independent of n. Without loss of 
generality, we assume n > 2. The tricky part of the problem is that the same 
kind of soldier having a fixed number of states must be synchronized, regardless 
of the length n of the array. 

The problem known as the firing squad synchronization problem was devised 
in 1957 by J. Myhill, and first appeared in print in a paper by E. F. Moore [10]. 
This problem has been widely circulated, and has attracted much attention. The 
firing squad synchronization problem first arose in connection with the need to 
simultaneously turn on all parts of a self-reproducing machine. The problem was 
first solved by J. McCarthy and M. Minsky who presented a 3n-step algorithm. 
In 1962, the first optimum-time, i.e. (2 n — 2)-step, synchronization algorithm 
was presented by Goto [3], with each cell having several thousands of states. 
Waksman [18] presented a 16-state optimum-time synchronization algorithm. 
Afterward, Balzer [1] and Gerken [2] developed an eight-state algorithm and a 
seven-state synchronization algorithm, respectively, thus decreasing the number 
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of states required for the synchronization. In 1987, Mazoyer [6] developed a six- 
state synchronization algorithm which, at present, is the algorithm having the 
fewest states. 

2.2 Complexity Measure for Optimum-Time Synchronization 
Algorithms 

• Time complexity. Any solution to the firing squad synchronization problem 
can easily be shown to require (2 n — 2) steps for firing n cells, since signals on 
the array can propagate no faster than one cell per step, and the time from the 
general’s instruction until the firing must be at least 2 n — 2. (See Balzer [1], 
Mazoyer [5, 6] and Waksman [18] for a proof.) Thus, we have: 

[Theorem 1.] [1, 5, 6, 18] Synchronization of n cells in less than 2n — 2 steps is 
impossible. 

[Theorem 2.] [1-3, 6, 18] Synchronization of n cells in exactly 2n — 2 steps is 
possible. 

• Number of internal states. The following three distinct states: the quiescent 
state, the general state, and the firing state, are required in order to define any 
cellular automaton that can solve the firing squad synchronization problem. The 
boundary state for Co and C n +i is not generally counted as an internal state. 
Balzer [1] showed that no four-state optimum-time solution exists. In addition, 
there exists no five-state optimum-time solution satisfying the special conditions 
that Balzer [1] studied. The question that remains is: “What is the minimum 
number of states for an optimum-time solution of the problem?” At present, that 
number is five or six. 

[Theorem 3.] [1] There is no four-state CA that can synchronize n cells. 
[Theorem 4.] [1] There is no five-state solution satisfying Balzer’s special con- 
ditions. 

• Number of transition rules. Any fc-state (excluding the boundary state) 
transition table for the synchronization has at most ( k — 1 )fc 2 entries in ( k — 1) 
matrices of size k x k. The number of transition rules reflects the complexity of 
synchronization algorithms . 

3 Transition Rule Sets for Optimum-Time Firing Squad 
Synchronization Algorithms 

3.1 Waksman’s 16-State Algorithm 

In 1966, Waksman [18] proposed a 16-state firing squad synchronization algo- 
rithm, which, together with an unpublished algorithm by Goto [3], is referred 
to as the first-in-the- world optimum-time synchronization algorithm. Waksman 
presented a set of transition rules described in terms of a state transition table. 
To describe the state transition table in a concise way, Waksman introduced a 
special symbol 7 that matches any state in its right and/or left neighbor. Since 
Waksman’s table cannot be used as is on a computer, we first expand the ta- 
ble automatically according to the notation given by Waksman [18]. We thereby 
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Fig. 1 . USN transition table consisting of 202 rules that realize Waksman’s synchro- 
nization algorithm. 



obtain a transition table that consists of 3208 rules. We implement these rules 
and examine the validity of the expanded table on a computer. The experiment 
revealed the following: 

[Observation 3.1.] For any n such that 2 < n < 2000, the expanded rule set 
yields successful firings only in the cases of n = 3, 5 or 6. 

Thus, the firing process fails in most cases. 



3.2 USN Transition Rule Set 

Umeo, Sogabe and Nomura [15] corrected all errors in Waksman’s original tran- 
sition table. After exploring the Waksman’s original transition rule set so that 
the resultant table yields correct brings for any n > 2, we get a new transition 
table consisting of is 3217 rules. 

[Observation 3.2.] For any n such that 2 < n < 10000, the rule set consisting 
of 3217 rules yields successful optimum-time firings. 

Our computer simulation based on the above list revealed that most of the 
rules are not used efficiently in the firing process. A histogram-like statistical 
analysis on the rule set is presented herein and the number of rules valid in the 
range of 2 < n < 10000 is reduced. Figure 1 is our final, complete list, which 
consists of 202 transition rules. We refer to this list as the USN transition rule 
set. 

In our correction, a ninety-three percent reduction in the number of transition 
rules is realized compared to Waksman’s original list. The computer simulation 
based on the table of Fig. 1 gives the following observation. Computer simulation 
shows that 202 rules is the smallest set for the Waksman’s optimum-time firing 
squad synchronization. 

[Observation 3.3.] The set of rules given in Fig. 1 is the smallest transition 
rule set for Waksman’s optimum-time firing squad synchronization algorithm. 
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Fig. 2. Transition table for Balzer’s algorithm. 







Right 


State 






\ 




light 


State 




/ l>N 




■ 


UJ- 


/i>im 




T 


# 






/ 








/ 




/ 








! 


\ 






[ 






ST 
























i 


\ 








* 








> 












1 




3> 


" 










1 




















■ 






c/l 




/ 


\ 


\ 




1 


















/ 








I 




/ 
















1 


/ 










/ 








I 


















# 




















# 








ZDE 




1 



1 


Right State 


/ 


>\i l\KIM# 


1 Left State | 


/ 


















1 


l 




l 


























/ 


/ 




























I 

# 


_L 






l 










1 









[ 


Right State 


/ l>N l\l<l U# 


1 Left State 




[ 




f 




[ 




[ 


■ 
















! 


> 




> 




> 




> 








i 




i 




i 


- 


\ 




i 




i 




i 


















# 

















Fig. 3. Transition table for the seven-state Gerken’s algorithm. 



3.3 Balzer’s Eight-State Algorithm 

Balzer [1] constructed an eight-state, 182-rule synchronization algorithm and 
the structure of which is completely identical to that of Waksman [18]. Our 
computer examination revealed no errors, however, 17 rules were found to be 
redundant. In Fig. 2, we give a list of transition rules for Balzer’s algorithm. 
Deleted rules are indicated by shaded squares, where squares containing letters 
denote rules included in the original list. In the transition table, the symbols 
“M”, “L”, “F” and “X” represent the general, quiescent, firing and boundary 
states, respectively. 



3.4 Gerken’s Seven-State Algorithm I 

Gerken [2] reduced the number of states realizing Balzer’s algorithm and con- 
structed a seven-state, 118-rule synchronization algorithm. In our computer ex- 
amination, no errors were found, however, 13 rules were found to be redundant. 
In Fig. 3, we give a list of the transition rules for Gerken’s algorithm. The 13 
deleted rules are marked by shaded squares in the table. The symbols “>”, 
“/”, “...” and represent the general, quiescent, firing and boundary states, 
respectively. 



3.5 Mazoyer’s Six-State Algorithm 

Mazoyer [6] proposed a six-state, 120-rule synchronization algorithm, the struc- 
ture of which differs greatly from the previous algorithms discussed above. Our 
computer examination revealed no errors and only one redundant rule. In Fig. 4, 
we give a list of transition rules for Mazoyer’s algorithm. In the transition table, 
the letters “G”, “L”, “F” and “X” represent the general, quiescent, firing and 
boundary states, respectively. 
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Fig. 4. Transition table for the six-state Mazoyer’s algorithm. 



3.6 Goto’s Algorithm 

The first synchronization algorithm presented by Goto [3] was not published as 
a journal paper. According to Goto, the original note [3] is now unavailable, and 
the only existing material that treats the algorithm is Goto [4] . The Goto’s study 
presents one figure (Fig. 3.8 in Goto [4]) demonstrating how the algorithm works 
on 13 cells with a very short description in Japanese. Umeo [12] reconstructed 
the algorithm of Goto based on this figure. Mazoyer [8] also reconstructed this 
algorithm again based on the presentation given by Umeo [12]. The algorithm 
that Umeo [12] reconstructed is a non-recursive algorithm consisting of a mark- 
ing phase and a 3n-step synchronization phase. In the first phase, by printing 
a special marker in the cellular space, the entire cellular space is divided into 
subspaces, the lengths of which increase exponentially with a common ratio of 
two, that is 2- 7 , for any integer j such that 1 < j < [log 2 n\ — 1. The marking 
is made from both the left and right ends. In the second phase, each subspace 
is synchronized using a well-known conventional (3n + 0(l))-step simple syn- 
chronization algorithm. A time-space diagram of the reconstructed algorithm is 
shown in Fig. 5. 




Fig. 5. Time-space diagram for Goto’s Fig. 6. A comparison of state-change 
algorithm as reconstructed by Umeo complexities in optimum-time synchro- 
112]. nization algorithms. 
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3.7 Gerken’s 155-State Algorithm II 

Gerken [2] constructed two kinds of optimum-time synchronization algorithms. 
One seven-state algorithm has been discussed in the previous subsection, and 
the other is a 155-state algorithm. The transition table given in Gerken [2] is 
described in terms of two-layer construction with 32 states and 347 rules. The 
table does not operate as is on a single-layer simulator. In order to obtain an 
exact transition rule set, we expand this transition table into a single-layer format 
and obtain a 155-state table consisting of 2371 rules. 



4 State Change Complexity 

Vollmar [17] introduced a state-change complexity in order to measure the ef- 
ficiency of cellular algorithms and showed that Q (n log n) state changes are re- 
quired for the synchronization of n cells in (2 n — 2) steps. 

[Theorem 5.] [17] fl(nlog?r) state-change is necessary for synchronizing n cells 
in (2 n — 2) steps. 

[Theorem 6.] Each optimum-time synchronization algorithm developed by 
Balzer [1], Gerken [2], Mazoyer [6] and Waksman [18] has an 0(n 2 ) state-change 
complexity, respectively. 

[Theorem 7.] [2] Gerken’s synchronization algorithm II has a 0(nlogn) state- 
change complexity. 

Let S(n) be total number of state changes for Yunes-like synchronization 
algorithms [9, 19] on n cells. We see that S(n) = an + 2S(n/2) = O(nlogn). 
Thus we have: 

[Theorem 8.] Each 3n— step synchronization algorithm developed by Minsky 
and MacCartlry [9] and Yunes [19] have a ©(nlogn) state-change complexity, 
respectively. 

Based on [Theorem 8] and our description given in subsection 3.6, we can 
establish the following theorem. 

[Theorem 9.] Goto’s synchronization algorithm as reconstructed by Umeo [12] 
has a ©(nlogn) state-change complexity. 

Figure 6 shows a comparison between the state-change complexities in 
optimum-time synchronization algorithms. 



5 A Comparison of Quantitative Aspects of 
Optimum-Time Synchronization Algorithms 

Here, we present a table based on a quantitative comparison of optimum-time 
synchronization algorithms and their transition tables discussed above with re- 
spect to the number of internal states of each finite state automaton, the number 
of transition rules realizing the synchronization, and the number of state-changes 
on the array. 
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Table 1. Quantitative comparison of transition rule sets for optimum-time firing squad 
synchronization algorithms. The symbol in parenthesis shows the correction and 
reduction of transition rules made in this paper. The “**” symbol indicates the number 
of states and rules obtained after the expansion of the original two-layer construction. 



Algorithm 


# of states 


# of transition rules 


State change complexity 


Goto [1962] 


many thousands 




B(n log n) 


Waksman [1966] 


16 


3216(202*) 


0(nQ 


Balzer [1967] 


8 


182 (165 *) 


oW) 


Gerken I [1987] 


7 


118 (105*) 


OOQ 


Mazoyer [1987] 


6 


120 (119*) 


Ob?) 


Gerken II [1987] 


32(155**) 


347(2371**) 


B(n log n) 



6 One-Sided Versus Two-Sided Recursive Algorithms 

Firing squad synchronization algorithms have been designed on the basis of par- 
allel divide-and-conquer strategy that calls itself recursively in parallel. Those 
recursive calls are implemented by generating many Generals that are respon- 
sible for synchronizing divided small areas in the cellular space. In Fig. 7, we 
show two recursive synchronization schemes. If all of the recursive calls for the 
synchronization are issued by Generals located at one (both two) end(s) of par- 
titioned cellular spaces for which the General is responsible, the synchronization 
algorithm is said to have one-sided ( two-sided ) recursive property. As for those 
algorithms discussed, we have the following observation: 

[Observation 6.1] Optimum-time synchronization algorithms developed by 
Balzer [1], Gerken [2] and Waksman [18] are two-sided ones. The algorithm 
proposed by Mazoyer [6] is a one-sided one. 

Now we propose a general design scheme for one-sided recursive optimum- 
time synchronization algorithms that can synchronize any n cells in 2n — 2 steps. 
Figure 8 is a time-space diagram for an optimum-time one-sided recursive syn- 
chronization algorithm. The General Go generates an infinite number of signals 
u>o,wi,W 2 , to generate Generals Gi, G 2 , .., by dividing the array recursively 
with the ratio x/y, where x, y is any positive integer such that 2x < y. The 
generated i-th General G;(i > 1) does the same operations as Go does. When 
x = 1,2/ = 2, the scheme coincides with the Mazoyer’s algorithm [6]. It is noted 
that, in this case, we need no delay for the generation of Generals. We can 
establish the following theorem. 

[Theorem 10.] The one-sided recursive scheme given above can synchronize 
any n cells in 2n — 2 optimum steps. 

7 Recursive Versus Non-recursive Algorithms 

As is shown in the previous section, the optimum-time synchronization algo- 
rithms developed by Balzer [2], Gerken [3], Mazoyer [6] and Waksman [18] are 
recursive ones. On the other hand, it is noted that overall structure of the recon- 
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2*2 



Fig. 7. One-sided recursive synchroniza- 
tion scheme (left) and two-sided recursive 
synchronization scheme (right). 



Fig. 8. Time-space diagram for 
optimum-time one-sided recursive 
synchronization algorithm. 



structed Goto’s algorithm [12] is a non-recursive one, however divided subspaces 
are synchronized by using recursive 3n + 0(l)-step synchronization algorithms. 

8 Finite Versus Infinite Number of Signals 

Waksman [18] devised an efficient way to cause a cell to generate infinite signals 
at propagating speeds of 1/1, 1/3, 1/7, .., l/(2 fc — 1), where k is any natural 
number. These signals play an important role in dividing the array into two, 
four, eight, . . . , equal parts synchronously. The same set of signals is used in 
Balzer [2]. Gerken [3] had a similar idea in the construction of his seven-state 
algorithm. Thus an infinite set of signals is used in the first three algorithms. On 
the other hand, finite sets of signals with propagating speed {1/5, 1/2, 1/1} and 
{1/3, 1/2, 3/5, 1/1} are made use of in Gerken [2] and the reconstructed Goto’s 
algorithm [12], respectively. 

9 A Comparison of Qualitative Aspects of Optimum-Time 
Synchronization Algorithms 

Here, we present a table based on a qualitative comparison of optimum-time 
synchronization algorithms with respect to one/two-sided recursive properties 
and the number of signals being used for simultaneous space divisions. 



10 0(1)-Bit Versus 1-Bit Communication CA Model 

In the study of cellular automata, the amount of bit-information exchanged 
at one step between neighboring cells has been assumed to be 0(l)-bit. A 1- 
bit inter-cell communication model is a CA in which inter-cell communication is 
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Table 2. A qualitative comparison of optimum-time firing squad synchronization al- 
gorithms. 



Algorithm 


One- / two-sided 


Recursive/non-recursive 


# of signals 


Goto [1962] 


— 


non-recursive 


finite 


Waksman [1966] 


two-sided 


recursive 


infinite 


Balzer [1967] 


two-sided 


recursive 


infinite 


Gerken I [1987] 


two-sided 


recursive 


infinite 


Mazoyer [1987] 


one-sided 


recursive 


infinite 


Gerken II [1987] 


two-sided 


recursive 


finite 



restricted to 1-bit data, referred to as the 1-bit CA model. The next state of each 
cell is determined by the present state of that cell and two binary 1-bit inputs 
from its left and right neighbor cells. A precise definition of the 1-bit CA can 
be found in Umeo and Kamikawa [13]. Mazoyer [7] and Nishimura, Sogabe and 
Umeo [11] each designed an optimum-time synchronization algorithm on a 1-bit 
CA model, based on Balzer’s algorithm and Waksman’s algorithm, respectively. 
[Theorem 11.] [7, 11, 14] There exists a 1-bit CA that can synchronize n cells 
in optimum 2n — 2 steps. 

11 Summary 

Cellular automata researchers have reported that several errors are included in 
Waksman’s transition table. However, the validity of the transition rule sets 
designed thus far has never been confirmed. This was one of our motivations 
that we started our study. In the present paper, we have examined via computer 
the state transition rule sets for which optimum-time synchronization algorithms 
have been designed over the past forty years. The first transition rule set designed 
by Waksman [18] includes fundamental errors that cause unsuccessful firings and 
ninety-three percent of the rules are redundant. In addition, the transition rule 
sets given by Balzer [1], Gerken [2] and Mazoyer [6] also include redundant rules. 
The authors think that it is worthy of publishing such smallest transition rule sets 
for the famous firing squad synchronization algorithms, and they are useful and 
important for researchers who might have interests in those transition rule sets 
that realize the classical optimum-time firing algorithms quoted frequently in the 
literatures. We have presented a survey and a comparison of the quantitative and 
qualitative aspects of the optimum-time synchronization algorithms developed 
thus far for one-dimensional cellular arrays. 
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Abstract. Decentralized peer to peer (p2p) networks like Gnutella are 
so successful because they are robust and require no centralized direc- 
tories and no precise control over network topology. In this paper, we 
develop an efficient search algorithm for p2p networks with the help of 
a 2-dimensional Cellular Automaton (CA) model. The rules followed by 
each individual cell of the CA are inspired by concepts of natural immune 
systems whereby the query message packets in the network are spread 
through opportunistic proliferation. Through a series of experiments, we 
compare proliferation with different variants of random walk algorithms. 
The detailed experimental results show message packets undergoing pro- 
liferation, spread much faster in the network and consequently produce 
better search output in p2p networks. Moreover, experimental results 
show that proliferation rules are extremely scalable and their perfor- 
mance is largely insensitive to the change in dimension of the CA grid. 



1 Introduction 

Significant research efforts have recently been undertaken in the development 
of peer-to-peer (p2p) computing systems. Due to their flexibility, reliability and 
adaptivity, p2p solutions can overcome a lot of disadvantages of client-server 
systems, and can fit to the dynamically changing Internet environment [7]. This 
explains the large popularity of file sharing systems like Gnutella [2], Napster 
[1], and Freenet [3]. 

Among different desirable qualities of a search algorithm, robustness is a very 
important property. That is, the performance of a search algorithm should not 
radically deteriorate in face of the dynamically changing network. As is known, 
the big share of Internet users, consequently participants in p2p networks, leave 
the community {p2p network) at very short intervals. Thus in order to give 
robustness a high priority, algorithms generally avoid precise routing algorithms 
for forwarding query message packets. However, flooding-based query algorithms 
used by the networks produce enormous amounts of traffic which substantially 
slows down the system. Recently flooding is increasingly replaced with different 
variants of more efficient fc-random walkers 1 algorithms [5]. 

* This work was partially supported by the Future & Emerging Technologies unit of 
the European Commission through Project BISON (IST-2001-38923). 

1 A system where there are k message packets each performing a random walk, inde- 
pendently. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 61—70, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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The goal of this paper is to study more-efficient alternatives to the existing 
fc-random walkers algorithms, which form the base algorithm for a large class of 
p2p networks. In this respect, we draw our inspiration from the natural immune 
system. Our algorithm has been inspired by the simple and well known concept of 
the humoral immune system where B cells upon stimulation by foreign antigens 
undergo proliferation generating antibodies. The increasing number of antibodies 
consequently efficiently track down the antigens. 

In this paper, we develop a 2-dimensional Cellular Automaton model to sim- 
ulate the p2p environment. Performance by different random walk and prolifer- 
ation techniques is evaluated with the help of the 2-dimensional C A. Hence in 
the next section, we detail the CA model used to abstract the p2p environment. 
The experimental results are noted in Section 3. 



2 Modeling and Evaluation Methodology 

In this section, we elaborate the two-dimensional CA model. Both state defi- 
nition of each cell of the CA as well as the rules guiding the state transition 
behavior of the cells are defined. The transition rules can be divided into two 
parts, rules which are common to both proliferation and random walk techniques 
and rules which are unique to one of these techniques. In Section 2.1 , we describe 
the features common to all the processes, whereas in Section 2.2, we explain the 
unique rules. 

2.1 CA Model 

It is impossible to model all the dynamics of a p2p system. In this paper, we 
are not attempting to resolve minor quantitative disparities between fc-random 
walk and proliferation algorithms, but instead are trying to reveal fundamental 
qualitative differences. While our simple models do not consider all aspects of 
reality, we hope they capture the essential features needed to understand the 
qualitative differences. 

Each cell of the two-dimensional {in x n) CA represents a peer/node in 
a p2p system. Each peer in the network (consequently cells in the CA) can 
communicate with four neighbors. The peers (cells) host some data which are 
searched by other peers (cells) during a search operation. For simplification, we 
have assumed that there are 1024 different types of data present in the network, 
each of which can be represented by a 10-bit string. Hence, each cell hosts a 
10-bit data string (say) P which we also refer to as token. 

The search operation is performed with the help of three rules - Query Ini- 
tiation Ride (QIR), Query Processing Rule (QPR) and Query Forwarding Rule 
(QFR). Next these rules are explained one by one. 

Query Initiation Rule (QIR): A query is initiated by a randomly selected 
peer which requests for some data (say M). To obtain an answer to the request, 
the peer floods message packets (M) into its neighboring peers. In order to 
receive these message packets, each cell of the CA maintains a message queue 
MQ. The length of the message queue is considered as unbounded. Like the 
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tokens, there are 1024 unique query messages (M). The Query Initiation Rule 
is 

Rule 1. QIR: Form Message Packet M /*Pick an element M from Query Set Q*/ 
Flood k message packets(M ) into the neighbors. 

Data and Query Distribution: The 1024 unique tokens (both data and query) 
are distributed according to Zipf’s law. It is generally believed that the distribu- 
tion of documents in the Internet follows Zipf ’s law [6] . In Zipf’s distribution the 
frequency of occurrence of some event (here token/message) t, as a function of 
the rank i, where the rank is determined by the above frequency of occurrence, 
follows a power-law ti oc In the experimental setup, the Zipf’s exponent 
(value of a) for both the data and query distribution is 1. The ranking in terms 
of frequency is the same, for both tokens and messages — for instance, the most 
popular token in the network is also the most popular query. 

Query Processing Rule (QPR): Once the search is initiated, the messages 
hop from cell to cell in subsequent time steps. Whenever, a cell encounters a 
message packet, it checks whether the message has earlier visited this cell or 
not. In this connection, each cell maintains a field named V. If the message has 
not previously visited the cell, then the message is compared with P. A successful 
search is reported if P = M. A field SS which initially is set to 0 everywhere is 
updated to 1 in the respective cell to indicate a successful search. 

Rule 2. QPR: If (Message packet(M)) Start. 

V++; 

if ((V == 1) AND (P = M)), /*Report. a match, V = 1 indicates first 
SS = 1 time visit by a message*/ 

Once the queries are processed, the messages have to be forwarded to the 
neighboring cells through a query forwarding rule. Each of the proliferation and 
random walk techniques has a unique query forwarding rule which will be defined 
in the next section. A summary of the common features is presented. 

CA Model - Summary: The states, rules, neighborhood configuration and 
updation synchrony of each cell of the two dimensional CA is next summarized. 

A. States: Token P: A 10-bit string which doesn’t change over time. 

Message Queue(MQ): A queue of messages of unbounded length. 

Visit CountfV ): A counter to track the number of messages visiting the cell. 
Success Indicator(SS ): A one bit switch which is set to one if there is a successful 
search. MQ, SS and V are initialized to 0 at the beginning of each search. 

B. Rules: Query Initiation Rule (QIR): During every search operation it is 
executed once only by a cell randomly selected to initiate the search. 

Query Processing Ride (QPR) & Query Forwarding Ride (QFR): These rules 
are executed by all the cells at each time step. QFR differs from process to 
process, which will be discussed in Section 2.2. 

C. Neighborhood: Each cell has four neighbors. The rules are periodic in 
nature, that is, in effect, the CA is a two-dimensional toroidal grid. 

D. Update Synchrony: The CA is updated asynchronously, the sequence in 
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which cells get updated is random. The asynchronous update mimics the real- 
life environment, where peers in the p2p network update their states without 
maintaining any synchrony with the neighboring peers. 

2.2 Query Forwarding Rule (QFR) 

We here describe two proliferation- based and two random walk based QFRs. 
Proliferation ( V ): In the proliferation scheme, packets undergo proliferation at 
each cell they visit. The proliferation is guided by a special function, whereby a 
message packet visiting a node proliferates to form N new message packets which 
are thereby forwarded to the neighbors of the node. 

Rule 3. V: If (Message packet(M)) Start 

Produce N new message packets(M ) 

Spread the N new packets to N new randomly selected neighbors 

of A 

The function determining the value of ‘ N new ’ ensures that N new < 4, where 4 
is the number of neighbors of any cell. The function is discussed in detail after we 
elaborate the proliferation rule which imposes restrictions in packet movement. 
Restricted Proliferation ( TIP ): The rule HP, similar to T, produces N new 
messages. But these N new messages are forwarded only if the node A has > 
Nnew free neighbors. By ‘free’, we mean that the respective cells haven’t been 
previously visited by message M. If A has Z ‘free’ neighbors, where Z < N new , 
then only Z messages are forwarded, while the rest is discarded. However, if Z 
= 0, then one message is forwarded to a randomly selected neighbor. 



Rule 4. HP: If (Message packet(M)), Start- 
Produce N new message packets (M). 

Z = No of ‘free’ neighbors 
if (Z > N new ) 

Spread the packets to N new randomly selected neighbors of A 
else if (Z > 0) 

Spread Z packets to Z free neighbors of A 
Discard the remaining (N new - Z) packets 
else 

Forward one message packet to a randomly selected neighbor of A 
Discard the remaining (N new - 1) packets 

We now elaborate the function used to control the amount of proliferation. 
Proliferation Controlling Function: We choose the proliferation of message 
packets at any cell A to be heavily dependent on the similarity between the 
message packet (M) and the token (P) of A. This is in line with the affinity 
based proliferation mechanism followed by immune systems. In this regard we 
define p, where p = e~ HD x p represents the proliferation constant, it is 
identical for all nodes; HD is the Hamming distance 2 between the message (M) 

2 The number of bits which differ between two binary strings 
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and the information profile (P). With the help of the expression p, we define 
P(ifj - the probability of producing at least 77 packages during proliferation by 
the following equation. 

p(v) = T,U 

The significance of the above equation is (a) .the number of packets proliferating 
is > 1 and < 4; (b). when HD = 0, for p = 4, 4 packets always proliferate; (c). 
the probability of proliferation decreases exponentially if 77 > p\ (d). for HD > 
0 , the probability of proliferation of 77 packets decreases exponentially for 77 > 1 . 

We now describe fc-random walk and a restricted version of it. 
k-random walk (1ZW): In fc-random walk, the cell forwards the message packet 
to a randomly selected neighbors. The rule (PIT) is represented as 



Rule 5. 1ZW : If (Message packet(M)) Start. 

Send the packet M to a randomly chosen neighbor peer 

Restricted random walk (HEW) : In IZRW , instead of passing the message 
( M ) to any random neighbor, we pass on the message to any randomly selected 
‘free’ neighbor. However, if there is no ‘free’ neighbor, we then pass on the 
message to any randomly selected neighbor. 



Rule 6. IZRW : If (Message packetfM )) start. 

Send the packet. M to a randomly chosen ‘ free ’ neighbor peer 
If no ‘free ’ neighbor 

Send the packet M to a randomly chosen peer 

The next section describes various metrics used to compare performances of the 
algorithms. 

2.3 Metrics 

In this paper we focus on efficiency aspects of the processes solely, and use the 
following simple metrics in our abstract p2p networks. These metrics, though 
simple, reflect the fundamental properties of the different query processing rules. 
Success rate: The number of similar items (say /C) found by the query messages 
within a given time period, that is AH = SSi, where i represents each cell 

and N is the total number of cells. 

Coverage rate: The amount of time required by the messages to cover a percent- 
age of the network. The network is fully covered when the visit field - V\ > 1 in 
each cell of the CA. 

Cost, per search output.: The number of messages required to output a successful 
search. In some sense, it is the inverse of the success rate, but it also provides 
different insights into the system, which we will discuss in the next section. 

Bad visits (BV ): The number of times the same message re-visits the same cell. 
If a message packet visits the same cell more than once, it amounts to wastage 
of that packet. BV = X^o,(i/V 7 >i) (^» ‘ !)• So, a good design should minimize 
the amount of bad visits. 
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3 Simulation Results 

The experimental results compare efficiency of different query processing rules 
(rule 3-6). The efficiency comparison is carried out with respect to the metrics 
defined in section 2.3. Beside assessing efficiency, we also perform different seal- 
ability tests with the QFRs. In order to assess the efficiency of different QFRs, 
we have also to ensure fairness of ‘power’ among them which is elaborated next. 



3.1 Fairness in Power 



To ensure fair comparison among all the QFRs , we must ensure that each QFR 
(V, 7 ZP, TZW , TZRW) should participate in the network with the same ‘power’. 
To provide fairness in ‘power’ between a proliferation QFR (say V) and a random 
QFR (say RW), we ensure that the total number of query packets used is 
roughly the same in both cases. Query packets determine the cost of the search; 
too many packets cause network clogging bringing down the efficiency of the 
system as a whole. It can be seen that the number of packets increases in the 
proliferation algorithms over the generations, while it remains constant in the 
case of random walk algorithms. Therefore the number of message packets - k 
in Rule 1 is set in a way so that the aggregate number of packets used by each 
individual algorithm is roughly the same, that is, for TZW, h-RW = t °ot^tep[v] ’ 
where Tot.Pack[P ] is the total number of message packets used by V to perform 
a particular experiment, while TotStep(P\ indicates the total number of time 
steps required to perform that task. 

To ensure fairness in ‘power’ between two proliferation rules (' P & 7 ZP), we 
keep the proliferation constant p and the value of k the same for both processes. 
The value of k is generally set as k = 4, where each cell has 4 neighbors. 

We are now reporting the major experimental results one by one. 



3.2 Experimental Result — Network Coverage 

To evaluate the time taken by the message packets to visit all the cells of the CA 
using different QFRs, we consider a 100 x 100 two-dimensional CA comprising 
of 10000 cells and perform the following coverage experiment. 

Coverage experiment: In this experiment, upon initiation of a search ( Rule 1) 
from a randomly selected cell, the CA is allowed to run till the message packets 
cover the entire network (all cells of CA 3 ). The experiment is repeated 500 times 
on randomly selected initial cells. During the experiment, we collect different 
statistic at every 10% of coverage of the network. 

Fig. 1 plots statistic collected through coverage experiments. Fig. 1(a) shows 
the network coverage rate of the 7 ZP, V algorithm at p = 3, as well as TZRW 
and TZW algorithms. The graph plots the % of network covered (rr-axis) vs. the 
time taken to cover corresponding % of network (y-axis - semilog scale). From 
the figure, it is seen that the time taken to cover the network is much shorter in 
7 ZP followed by V, TZRW and TZW. 

3 The term ‘network’ and ‘CA’ are used interchangeably throughout the section. 
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Fig. 1. Graphs plotting network coverage time, average message used, and number of 
bad visits by V, TZP, TZW and 7 ZRW (semilog scale). 



Fig. 1(b) plots the average number of packets used by the different QFRs 
(y- axis) vs. network coverage (tr-axis). As expected, the number of packets in- 
crease in TZP /P over the period of network coverage while it remains constant for 
random walk {TZW / TZRW ) schemes. Therefore, while performing coverage ex- 
periments, the value of k for TZRW /TZW is set to the average number of message 
packets used by TIP to cover 100% of the network. Although the proliferation 
constant is the same for V and TIP, from the figure it is seen that V produces 
enormous amount of messages, almost 10 times more than TZP. This happens 
because P indiscriminately proliferates. The tendency is further detailed through 
the next figure {Fig. 1(c)). 

Fig. 1(c) shows the number of bad visits (defined in Sec. 2.3) by TZP, V, 7 ZRW 
and TZWs {y- axis) vs. network coverage (tr-axis). It is seen that execution of both 
TZW and P results in a huge amount of bad visits due to their indiscriminate 
behavior. However, even though both TZRW and TZP generally inherently try to 
avoid already visited nodes, we find that TZP can avoid such visits much more 
efficiently. (The number of bad visits by TZRW is 10 times higher than TZP.) 
Therefore, we can conclude that TZP by far outperforms all other techniques in 
terms of network coverage. 



3.3 Experimental Results Search Efficiency 

To compare the search efficiency of TZP & TZRW, we perform the time-step 
experiment on the 100 x 100 two-dimensional CA. 

Time-step experiment: In the experiment, after initiation of a search {Ride 1), 
the CA is allowed to run for A f (= 50) time steps. The search is initiated by a 
randomly selected node and the number of similar items (n s ) found within 50 
time steps from the start of the search is calculated. The search output {n s ) is 
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Fig. 2. Graphs showing search efficiency and cost incurred by 7 ZP and 7 ZRW . 



averaged over one generation (100 different searches), whereby we obtain N s , 
where N s = n a /10Q. 

The graph of Fig. 2(a) displays the average value N s against generation 
number for 7 ZP and URW . The search results for both 7 ZP and URW show 
fluctuations. The fluctuations occur due to the difference in the availability of 
the searched items selected at each generation. However, on average, the search 
efficiency of UP is almost 5-times higher than that of URW . (For UP, the 
number of hits ss 65, while it is ss 13 for URW.) The fluctuations in the results 
help us to understand an important aspect about cost which is discussed next. 

Fig. 2(b) displays the cost /search item (the number of messages required to 
produce a search output) each scheme incurs to generate the performance of Fig. 
2(a). The cost of UP is hardly changing (it stays constant at around 0.7) even 
though the corresponding search output is differing hugely, while in URW there 
is significant fluctuation in terms of cost. This can be easily understood from 
the fact that URW always starts with the same number of packets irrespective 
of the availability of the items. Therefore, when the search output is low, the 
cost shoots up sharply. While in UP, the packets are not generated blindly, but 
are instead regulated by the availability of the searched item. Therefore, if a 
particular searched item is sparse in the network, UP produces a lower number 
of packets and vice versa. 



3.4 Experimental Result Scalability 

In this section, we perform two different types of scalability tests. In the first 
type, we change the shape of the two-dimensional CA so far used, from the 
square CA (100 x 100) to a more rectangular shape, and observe the impact 
it creates on the efficiency of UP & URW. In the second case, we increase the 
dimension of the CA and observe the outcome. 
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a. Network coverage time taken by TZP b. Network Coverage time taken by 7 ZP 

and TZRW with different shape of CA, and TZRW at different CA size, 

each CA having 10000 cells. 



Fig. 3. Graphs illustrating scalability of 1ZP and TZRW under different conditions. 



Shape and Scalability: In this experiment, we consider different two dimen- 
sional CAs each having 10000 cells. The dimensions of such CAs are respectively 
100 x 100, 200 x 50, 400 x 25, 500 x 20, 1000 x 10. Testing the efficiency in all 
such configurations is particularly important because in real p2p environments, 
we may not be able to develop exactly square connection among the peers and it 
is necessary for a proposed algorithm to perform well irrespective of the nature 
of connection layout. We perform the coverage experiments on these differently 
shaped CA; the experimental results are illustrated in Fig. 3(a). 

Fig. 3(a) plots the network coverage time ( y -axis) taken by 7 ZP and TZRW 
against different configuration ( x -axis). For each, 7 ZP and TZRW , we plot the 
time taken to cover all the cells of the CA. The figure shows that for both 7 ZP 
and TZRW , as the shape becomes more rectangular, the network coverage time 
increases. For example, 7 ZP takes 198 time steps to cover the network when the 
CA is square, whereas it takes 1951 time steps when the CA has the dimension 
1000 x 10. Whereas in case of TZRW , the respective figures rise from 1105 to 
31025. That is, 7 ZP has a five-fold rise in network coverage time, while for TZRW 
it increases by a factor of (around) 30. 

Size and Scalability: To measure the performance of 7 ZP and TZRW with 
respect to CAs of different sizes, we consider CAs of dimension 100 x 100, 300 
x 300 and 500 x 500. We perform both time- step and coverage experiments with 
7 ZP and TZRW on the three CAs. In case of the time-step experiment, a scalable 
algorithm should ensure that the number of search outputs does not deteriorate 
with the increase in dimension. And this is indeed confirmed by our experiments 
for both 7 ZP and TZRW (data not shown) . 

The more important test of scalability is elaborated through the next result 
derived by executing a coverage experiment and is depicted in Fig. 3(b). Fig. 3(b) 
plots the time ( y -axis) taken by 7 ZP and TZRW to cover 100% of the network 
against different configuration (x -axis) . The coverage time in 7 ZP increases from 
198 (for 100 x 100) to 586 (for 500 x 500). The rate of increase is less than the 
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logarithm of the rate of increase in the number of nodes. Any search algorithm 
in p2p network with such low rate of increase is considered to be extremely 
scalable [4]. However, for 1ZRW, the network coverage time increases from 1105 
to around 16161, increasing almost linearly with the number of nodes. 

4 Conclusion 

One of the most important functionalities of p2p networks is search. In this pa- 
per, we have concentrated on developing efficient search algorithms for p2p net- 
works. We have produced detailed experimental results showing that the simple 
immune-inspired concept of proliferation can be used to cover the network more 
effectively than random walk. Moreover, the search efficiency of a rule based 
on proliferation (7 ZP) is five times higher than that of restricted random walk 
(' 1ZRW ) algorithms. The proliferation algorithm is extremely scalable. There 
is only minor deterioration in network coverage time with respect to changing 
shape and size. The effectivity of opportunistic proliferation rules we believe is 
a fundamental result and can be applied beyond the domain of the proposed 
p2p search application. However, a detailed theoretical analysis to explain these 
interesting results has to be undertaken in the recent future to explore the full 
potential of the proliferation algorithm. 
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Abstract. This paper exploits the fact that an asynchronous cellular 
automata naturally provides a randomized algorithm. We study the pos- 
sibility to repeat many runs over the same problem instance to improve 
the quality of the answer. We consider the case of the so-called density 
task and quantify the interest of the approach. In addition we show that 
almost 100% of success can be achieved provided that the density task 
is allowed to classify the configuration in three rather than two classes. 



1 Introduction 

Cellular Automata (CAs) are discrete dynamical systems that have been used for 
simulating physical, chemical, social, and biological systems that are difficult or 
impossible to model using differential equations or other standard mathematical 
methods [3]. Usually, one assumes that the cells of a CA are formally updated 
simultaneously. However, perfect synclrronicity is only an abstraction: if CAs 
are to model physical or biological situations or are to be considered physically 
embodied computing machines then the synclrronicity assumption is no longer 
adequate. In fact, in any spatially extended system signals can only travel at 
finite speeds. Hence, for given dimensions it is impossible for a signal emitted 
by a global clock to reach any two computing elements at exactly the same 
time. In biological and sociological environments agents act at different, and 
possibly uncorrelated, times which seems to preclude a global clock in many 
cases. In previous work, we have shown that one can evolve asynchronous CAs 
for computation, in particular for the density task [10]. Although asynchronous 
CAs proved to be less efficient than their synchronous counterparts, they have 
the important advantage of being more robust in noisy environments. 

A randomized algorithm is a computation whose evolution is driven by ran- 
dom choices. It produces a different output each time it is executed, even if the 
input data is the same [7,8]. Repeating several time a randomized algorithm on 
the same problem instance reduces and bounds the possibility of errors. In many 
cases, randomized algorithms are known to be more efficient than their deter- 
ministic counterparts. Here, we apply the principles of randomized computation 
to improve the performance of an asynchronous CA for the density task. 
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The paper is organized as follows. Section 2 briefly describes synchronous 
and asynchronous CAs. Section 3 describes the density task. In section 4 we 
describe our randomized computation. Section 5 reports on the experiments and 
the results. Finally, section 6 presents our conclusions. 

2 Synchronous and Asynchronous Cellular Automata 

CAs are dynamical systems in which space and time are discrete. A standard 
CA consists of an array of cells, each of which can be in one of a finite number 
of possible states, updated synchronously in discrete time steps, according to a 
local, identical transition rule. Here we will only consider boolean automata for 
which the cellular state s £ {0, 1}. The transition rule contained in each cell is 
specified in the form of a rule table, with an entry for every possible neighborhood 
configuration of states. The state of a cell at the next time step is determined by 
the current states of a surrounding neighborhood of cells. The regular cellular 
array (grid) is d-dimensional, where d = 1,2,3 is used in practice. For one- 
dimensional grids, a cell is connected to r local neighbors (cells) on either side 
where r is referred to as the radius (thus, each cell has 2r+l neighbors, including 
itself). The term configuration refers to an assignment of ones and zeros to all 
the cells at a given time step t. It can be described by s 4 = (s s^, . . . , s t N _ 1 ), 
where N is the lattice size. The CAs used here are linear with periodic boundary 
conditions s)y +j; = s 4 i.e., they are rings. 

To visualize the behavior of a CA one can use a two-dimensional space-time 
diagram, where the horizontal axis depicts the configuration at a certain time t 
and the vertical axis depicts successive time steps (for example, see figure 1). In 
the figures state 1 is black, and state 0 is white. 

As noted before, asynchronous CA are physically more reasonable than par- 
allel ones. However, there are many ways for sequentially updating the cells of 
a CA (for a discussion of this point, see [9,10]). The most general and unbiased 
way is independent random ordering of updates in time, which corresponds to 
a Poisson stochastic process. We use a close approximation to it which con- 
sists in randomly choosing the cell to be updated next, with replacement. This 
corresponds to a binomial distribution for the update probability, of which the 
Poisson distribution is the limiting case for large N. 

It should be noted that because our chosen asynchronous updating is nonde- 
terministic, the same CA may reach a different configuration after n time steps 
on the same initial distribution of states, which is of course not the case for 
synchronous CA. This is because the trajectory in configuration space that is 
followed depends on the evaluation order of the cells for asynchronous CA, while 
there is a single possible sequence of configurations for a synchronous CA for a 
given initial configuration of states (see, e.g. figure 1). 

3 The Cellular Automata Density Problem 

The density task is a prototypical distributed computational problem for CAs. 
For a finite CA of size N it is defined as follows. Let p° be the fraction of Is in 
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the initial configuration (IC) s°. The task is to determine whether p° is greater 
than or less than 1/2. In this version, the problem is also known as the majority 
problem. If p° > 1/2 then the CA must relax to a fixed-point configuration of all 
l’s that we indicate as (1)^; otherwise it must relax to a fixed-point configuration 
of all 0’s, noted (0)^, after a number of time steps of the order of the grid size 
N (if N is odd one avoids the case p° = 0.5). 

This computation is trivial for a computer having a central control. Indeed, 
just scanning the array and adding up the number of, say, 1 bits will provide the 
answer in O(N) time. However, it is nontrivial for a small radius one-dimensional 
CA since such a CA can only transfer information at finite speed relying on local 
information exclusively, while density is a global property of the configuration 
of states [6] . 

It has been shown that the density task cannot be solved perfectly by a 
uniform, two-state CA with finite radius [4] , although a slightly modified version 
of the task can be shown to admit perfect solution by such an automaton [2] . The 
performance P of a given rule on the majority task is defined as the fraction 
of correct classifications over 10 4 randomly chosen ICs. The ICs are sampled 
according to a binomial distribution (i.e. , each bit is independently drawn with 
probability 1 /2 of being 0) . Clearly, this distribution is strongly peaked around 
p° = 1/2 and thus it makes a difficult case for the CA to solve. 

But the lack of a perfect solution does not prevent one from searching for 
imperfect solutions of as good a quality as possible. In general, given a desired 
global behavior for a CA, it is extremely difficult to infer the local CA rule 
that will give rise to the emergence of a desired computation. Since exhaus- 
tive evaluation of all possible rules is out of the question except for elementary 
(d = l,r = 1) automata, one possible solution of the problem consists in using 
evolutionary algorithms, as first proposed by Mitchell et al. [5,6]. 

In a recent work [10], we evolved asynchronous CAs for the density task with 
a genetic algorithm (see 1). Compared with synchronous CAs for the same task, 
we found that the asynchronous version is slightly less efficient but it has better 
fault-tolerance capabilities, which is an important feature in noisy environments. 
Figure 1 shows two executions of the evolved asynchronous CA on the same 
initial configuration. Its performance on the density task is P = 67.2%. In this 
work, we do not deal with robustness due to the update mode of the CA, but 
rather we try to enhance the performance of the asynchronous CA by using a 
randomized computation scheme. 



4 Randomized Algorithms 

4.1 General Concepts 

Randomized algorithms have become popular in computer science since, in many 
cases, a randomized algorithm is simpler and faster than a deterministic algo- 
rithm [1,7]. For instance, randomized algorithms exist that run in polynomial 
time, while no polynomial-time deterministic algorithm exist for the problem, 




74 



B. Chopard and M. Tomassini 




(a) (b) 



Fig. 1 . Space-time diagrams of an asynchronous CA for the density task. The density 
p° = 0.55 in both (a) and (b) and the initial state configuration is the same. The 
different time evolution is due to the nondeterminism of the updating policy. 



like primality testing [8]. But there is a price to pay for that: a randomized al- 
gorithm performs random choices during some of its steps and thus, if executed 
several times, it does not exhibits the same behavior. 

The so-called Las- Vegas randomized algorithms are characterized by a com- 
pletion time that varies from one run to another [1]. Sometimes, a Las- Vegas 
run may not finish. It this case, a second run, with the same problem instance 
may well finish in an acceptable time. 

In the so-called Monte-Carlo randomized algorithms, given an instance of a 
problem, the value of the solution should be regarded as a random variable [1]. 
Randomized Monte Carlo algorithms cannot guarantee that false answers will be 
avoided. Therefore it is important to estimate the probability p s that, on a given 
instance s, the algorithm yields a correct answer. If p s > 1/2, the probability 
of an error can be brought down to any desired finite level, provided that the 
experiment is repeated a sufficient number of times. This property is known 
as stochastic amplification and can be understood as follows. If we call p the 
probability of computing the correct result in a run, then the probability that 
the algorithm gives the correct answer on the input i times in k runs is binomially 
distributed, since the runs are independent Bernoulli random variables: 

Prob(k,i ) = p l (1 — p) k ~ l 
Thus, the probability of answering correctly is: 

Prob({n = r) > k/2) = ^ ( k ^j p l (1 - p) k ~ l 

i—k/2+1 ' ' 





Randomized Computation with Cellular Automata 



75 



By using a normal approximation to the binomial distribution, one finds for 
example that, for k = 42 runs and with p = 0.672, i.e. the performance of 
the rule, the probability of getting at least i = 22 correct answers, and thus of 
correctly classifying the configuration, is about 0.98. Note that we used p = 0.672 
in this example because it corresponds to the performance of one run of our 
asynchronous CA sampled over 10 4 different initial configuration. However, this 
value is not the average probability of success of the algorithm over the sampled 
configuration. 
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Fig. 2. Histogram of the probabilities that a configuration of given density p° is evolved 
by the asynchronous CA to the fixed point (1)^. with all ones. The probability range 
has been divided in 20 bins. The number of configurations in each bin is normalized 
by the total number of configurations (1000). System size is N = 149 and the number 
of repetition is N r = 100. 



4.2 The Random Behavior of the Asynchronous CA 

As described previously, on configuration s°, the asynchronous CA reaches one 
of the two fixed points (1)^ or (0)^ in a number t oo of site updates. If the 
algorithm is run only once, the output is p° > 1/2 if configuration (1)^ is reached 
and p° < 1/2 if the final state is (0) w . Obviously, this is not always the correct 
answer regarding to the actual density of s°. Unfortunately, the probability p s o 
that this procedure gives the right answer is not known and varies very much 
from one configuration to the other. 
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Note also that the time too needed to reach one of these two possible fixed 
points also depends on the updating path. Thus, the asynchronous CA for the 
density task is a randomized algorithm that behaves as a combination of a Monte- 
Carlo and Las- Vegas algorithm. 

A natural way to improve the performance of the algorithm is to repeat it 
several time. If one considers N r repetitions of the asynchronous CA on a chosen 
configuration s°, we shall observe that in N_> 1 cases, the CA has evolved to (l) w 
whereas, in N r — N_ tl other cases, the final state is the null configuration (0)^ . 

Therefore, we define /V , as the probability that configuration s° converges 
towards (1)^. If the number N r of repetitions of the algorithm is large, this 
probability can be estimated by 



P -( S °) = §f W 

This probability of converging to configuration 1 is quite dependent of the 
problem instance s°. However, we expect that P_ tl will be small if p° is small, 
and large in the opposite case. But, for two initial configurations of same density, 
the corresponding P_ >1 can be quite different, as shown in fig. 2. 

Figure 2 shows the histogram of the values of for 1000 configurations 
chosen randomly with various prescribed densities p. Practically speaking, these 
configurations are generated by a random permutation of the cells of a configu- 
ration containing the desired number of ones and zeros. The situation depicted 
in fig. 2 concerns a CA of size N = 149. For each configurations, N r = 100 
repetitions of the asynchronous CA runs are performed. The situation of fig. 2 
corresponds to densities p° = 0.35, p° = 0.4, p° = 0.45, p° = 0.55, p° = 0.6 and 
p° = 0.65. The case of p° = 0.5 will be shown later on. 

In order to evaluate the statistical significance of the curves obtained for 100 
repetitions and 1000 configurations, each experiment has been conducted twice, 
i.e for two sets of 1000 random configurations. We see that, for these parameters, 
the algorithm gives quite reproducible results. 



5 Classification Schemes 

The purpose of the present asynchronous CA is to perform the density task, that 
is to classify each configuration s in different classes, according to the predicted 
value of p°. In this section, we show how N r repetitions of the CA evolution can 
be used for this task. We shall first consider a straightforward extension of the 
standard case N r = 1 and show that the quality of the classification improves 
beyond that obtained with the synchronous CA [5] provided that N r is large 
enough. 

Then, we shall propose a new way to classify the configuration, using three 
classes instead of two. We show that this methods lead to an almost perfect 
classification scheme. 




Randomized Computation with Cellular Automata 



77 



5.1 Classification in Two Classes 

The natural way to exploit the result of our randomized algorithm is to base the 
classification process on the value of P_> 1 . The following pseudocode describes 
the randomized algorithm we have used to solve the density task: 

— choose an initial configuration s°; set = 0 

— do N r independent runs of the CA on s° and 

for each run, increment N r if the final configuration is (l) w 

— Compute P_ >1 = N^ 1 /N r 

— if P_ tl >1/2 then predict p° > 1/2; 
else predict p° < 1/2; 

Following the standard procedure to evaluate the quality of a CA algorithm 
to perform the density task [5] , we consider 10 4 random configurations for which 
each cell is initialized to 1 with probability 1/2. The performance a of the al- 
gorithm is then the fraction of these configurations that have been correctly 
classified. 

We first give the performance a of our Monte-Carlo CA algorithm as a func- 
tion of the number of repetition N r . The result is shown in fig. 3 (a). We observe, 
as expected, that for N r = 1, we obtain the result of [10], namely a ~ 0.67. 

Note that we take N r as an odd number so that P^ 1 is never one-half. As 
N r increases, a gets larger, showing the stochastic amplification of repeating a 
Monte-Carlo algorithm several time. However, we observe a saturation of a. 

The work done to improve the quality of the prediction increases as the num- 
ber of repetition N r increases. Here the work is defined as the average number 
of site updates that are needed in total to classify one of the 10 4 configurations. 
It is thus an indication of the average completion time of the algorithm over 
many instances. This work is proportional to N r but, it is observed that the 
proportionality constant increases with the system size N. Table 1 summarizes 
our measurement of the work. 



Table 1 . Average normalized work for one run of the asynchronous and synchronous 
CA. The total work is normalized by the system size. Thus the given figures corresponds 
to the number of so-called Monte-Carlo steps (asynchronous case) or CA iterations 
(synchronous case). Data are obtained from an average over 10 4 configurations. 
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In fig. 3 (b) we compare, as a function of the lattice size N, the work of both 
the synchronous (1 run 1 ) and asynchronous CA (N r runs) to obtain the same 

1 The synchronous CA cannot be repeated since it is deterministic. However random 
errors could be added to produce a randomized algorithm. This possibility has not 
been explored here 
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nx=149, 10000 configurations 
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Fig. 3. (a) The performance of our Monte-Carlo CA algorithm for the density task, as 
a function of the number of its repetition on each instance. The lattice size is N = 149, 
and 10 4 configurations have been generated at random. The dashed line indicates the 
performance achieved with one run of synchronous updating, namely 0.77. (b) Work 
normalized to system size to achieve same performance with the asynchronous and 
synchronous scheme. 



classification performance. We have considered the sizes N = 149, N = 200 and 
N = 500 for which the performances of the synchronous CA are 0.77, 0.75 and 
0.73, respectively. 

We see that the ratio work/performance of the synchronous CA is quite good 
compared to that of the asynchronous one. However, the synchronous CA has a 
maximum performance that cannot be improved, as opposed to the asynchronous 
one. In addition, it is argued that a VLSI implementation of an asynchronous 
CA would run faster than the synchronous version 2 . When N = 200, a factor 
of 4 in the hardware speed of the implementation would be necessary to make 
the randomized asynchronous CA faster than its deterministic counterpart while 
keeping the same classification performance. 



5.2 Classification in Three Classes 

The previous scheme can be improved by using three classes instead of two. 
Clearly, the configurations whose density is difficult to classify as larger or smaller 
than one-lralf are those configurations with a density close to one-half. These con- 
figurations turn out to also have a spread around 0.5. This is well illustrated 
in fig. 4 (a) where we show the histogram of P^ 1 for densities p° = 0.4, p° = 0.5 
and p° = 0.6. 

2 A. Stauffer, private communication 
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nx=149, 10000 configurations 




Fig. 4. (a) histogram of P_ tl for densities p° = 0.4, p° = 0.5 and p° = 0.6. The 
two dashed lines gives the histograms when, respectively, 1000 and lO’OOO random 
configurations are visited (each bit generated with probability 1/2). (b) Score of the 
Monte-Carlo CA algorithm for the density task when the classification is made in 
three classes p < 0.5, p > 0.5 and 0.4 < p < 0.6. The number of repetitions is 
N r = 1,3,5,10,20, 100. 



Figure 4 (a) suggests a new classification scheme. The rule we can extract is 
the following, for any configuration s. 

- If P^ < 0.3 then p° < 0.5 

- If P^ > 0.75 then p° > 0.5 

- If P^ > 0.3 and P_^ < 0.75 then 0.4 < p° < 0.5 

Note there is a slight asymmetry in the chosen threshold 0.3 and 0.75. These 
values are those suggested by figure 4 (a) but this issue has not been further 
investigated here. 

It is important to stress that the three regions defined by the above conditions 
{p < 0.5, p > 0.5, 0.4 < p < 0.6 overlap as a configuration can be at the same 
time in the region 0.4 < p < 0.6 and p < 0.5 (or p > 0.5). However, the algorithm 
will classify each configuration in only one region. The information we get from 
these three classes is different from that given with the standard approach but 
not less. The pertinence of this classification depends the actual reason why the 
density task is performed. 

Note that a classification in 3 regions is only possible with the randomized 
algorithm which gives an output P_ yl with more than two values. 

The quality of this scheme is extremely good: over 95%, even for N r = 10. 
Figure 4 (b) shows the performance we obtain as a function of N r , for a system 
of size N = 149. It is possible that the thresholds used to define the three regions 
depend on the lattice size. This question has not been investigated yet. 
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Finally, the thresholds 0.3 and 0.75 on P_ +1 can be adjusted to narrow the 
middle region (0.4 < p° < 0.6). However, this will decrease the score of the 
algorithm. More simulations would be required in order to obtain the relation 
between these threshold values and the algorithm performance. 

6 Conclusions 

In this paper we showed that an asynchronous CA for a global task can be 
naturally turned into a randomized, Monte-Carlo algorithm. It gives a different 
answer each time it is executed, even if it is on the same input data. By repeating 
it several times we can improve the performance of the classification of initial 
configuration with respect to their initial densities. The ratio work/performance 
of this approach is compared with that of the traditional synchronous approach. 

We then propose a new way to perform the classification of the initial density 
p°, using three classes: p° < 1/2, p° > 1/2 and 0.4 < p° < 0.6. This possibility 
is offered by the random nature of the algorithm and yields more than 95% 
performance for a moderate cost. 

Clearly, the combination of asynchronous CA and randomized algorithms 
offers new perspectives in the framework of global task computations. 
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Abstract. Modeling free-form shapes in 3D spaces based on strict physical 
laws require a considerable amount of computation time. Previous experiences 
with Cellular Automata demonstrated substantial improvements for these appli- 
cations. Here, we describe an efficient approach applying the Cell-DEVS for- 
malism to model the deformation of free-form shape objects in a 3D space. The 
objects are treated as virtual clay, and the values assigned to each cell in the 
model represent a certain amount of clay in the cell. The clay object is de- 
formed in accordance with the physical conservation laws. 



1 Introduction 

3D modeling techniques have become increasingly attractive, ranging from CAD 
systems up to image morphing in computer graphics. The representation of solid 3D 
objects requires to use restrictive geometrical operations. Different studies on the 
simulation of such deformations for objects use strict physical models, such as finite 
element methods, methods based on elasticity theory, and applications of particle 
systems. All these methods and applications need considerable time for computing 
deformations according to the laws, and human interactions are not permitted espe- 
cially for complex shapes. 

Instead, imagining the 3D objects as clay that can be freely deformed, we can un- 
derstand problems on 3D objects. Some of the undergoing efforts considering volume 
sculpting in a 3D virtual space use a discretization of the space in 2D or 3D cells. In 
[1], a solution based on Cellular Automata (CA) was presented. A 3D CA is used to 
simulate plastic deformations of clay, and each cell is allocated a finite state automa- 
ton, which is given the simple distribution rules of the virtual clay instead of compli- 
cated physical laws. Each automaton repeats state transitions according to the state of 
their neighbors. This approach tries to avoid the considerable computation time for 
plastic deformation and make the deformation process natural by showing the be- 
havior of real clay. An extension of this work, presented in [2], includes new reparti- 
tion algorithms. 

We will show how to model a 3D free-form object using Cell-DEVS [3], and we 
will simulate deformation using the CD++ toolkit [4]. Cell-DEVS is a novel approach 
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to represent models of real systems as cell spaces. Cell-DEVS uses the DEVS (Dis- 
crete Events systems Specifications) formalism [5] to define a cell space where each 
cell is defined as a DEVS model. This technique permits to build discrete-event cell 
spaces, and it improves their definition by making the timing specification more ex- 
pressive. The Cell-DEVS presented here describes the behavior of each cell in a 3D 
free-form virtual clay model using the state transition rules presented in [1], This 
model describes effectively the behavior of free-form object: compression (from 
outside) and deformation (from inside). 



2 Background 

Cell-DEVS [3] is a combination of CA and DEVS [5] that allows the implementation 
of cellular models with timing delays. A Cell-DEVS model is defined as a lattice of 
cells holding a state variable and a computing apparatus to update the cell state. This 
is done using the present cell state and a set of inputs coming from cells in the neigh- 
borhood. Cell-DEVS improves execution performance of cellular models by using a 
discrete-event approach. It also enhances the cell’s timing definition by making it 
more expressive [6]. Each cell, defined as TDC=< X, Y, S, N, delay, d, d INT S En , T, A, 
D >, uses N inputs to compute its next state. These inputs, which are received through 
the model's interface (X, Y), activate the local computing function (x). State (s) 
changes can be transmitted to other models, but only after the consumption of a delay 
(d). Two kinds of delays can be defined: transport delays model a variable commut- 
ing time, and inertial delays, which have preemptive semantics (scheduled events can 
be discarded). Once the cell behavior is defined, a coupled Cell-DEVS is created by 
putting together a number of cells interconnected by a neighborhood relationship. 



Cell's connections 




Fig. 1 . Description of a Cell-DEVS model. 

A coupled Cell-DEVS is composed of an array of atomic cells, with given size and 
dimensions, defined as GCC = < Xlist, Ylist, X, Y, n, { t tj , N , C, B, Z>. Each cell 
is connected to its neighborhood through standard DEVS input/output ports. Border 
cells have a different behavior due to their particular locations. Finally, the model’s 
external couplings can be defined in the Xlist and Ylist. Each cell in a Cell-DEVS is a 
DEVS atomic model, and the cell space is a DEVS coupled model. DEVS is an in- 
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creasingly accepted modeling and simulation framework. DEVS is a sound formalism 
based on generic dynamic systems, including well defined coupling of components, 
and hierarchical modular construction. A DEVS model is described as a composite of 
submodels, each of them being behavioral (atomic) or structural (coupled). Each 
atomic model, defined by AM = < X Y, S, S exl , S im , A, ta>, has an interface consisting 
of input (X) and output (Y) ports to communicate with other models. Every state (S) 
in the model is associated with a time advance ( ta ) function, which determines the 
duration of the state. Once this time is consumed, an internal transition is triggered. 
At that moment, the model generates results through the output ports by activating an 
output function (A). Then, an internal transition function (S inl ) is fired, producing a 
state change. Input external events, which are received in the input ports, activate the 
external transition function (8 ext ). Coupled models are defined as a set of basic com- 
ponents (atomic or coupled), which are interconnected through the model's interfaces. 
The model’s coupling defines how to convert the outputs of a model into inputs for 
the others, and to inputs/outputs to the exterior of the model. 

CD++ [4] is a modeling tool that was defined using DEVS and Cell-DEVS specifi- 
cations. DEVS Atomic models can be incorporated onto a class hierarchy pro- 
grammed in C++. Coupled models can be defined using a built-in specification lan- 
guage, in which the model is specified by including the size and dimension of the cell 
space, the shape of the neighborhood and borders. The cell’s local computing func- 
tion is defined using a set of rules with the form: POSTCONDITION DELAY { 
PRECONDITION } . These indicate that when the PRECONDITION is satisfied, the 
state of the cell will change to the designated POSTCONDITION, whose computed 
value will be transmitted to other components after consuming the DELAY. 



3 A Virtual Clay Model 

In virtual clay models, a 3D object deformation is considered as a physical process 
that equally distributes the virtual clay to the adjacent areas. A threshold is associated 
with the deformation of the object [1]: when the density is under the threshold, the 
object keeps its shape. If a portion receives an external force, its density changes; if 
the density is above the threshold, the object is deformed, and clay is transported from 
high-density to low-density portions. However, the total mass of the clay should be 
conserved. We model the object as a 3D Cell-DEVS. A positive value assigned to 
each cell represents the mass of clay in the piece. In [ 1], the authors define the model 
using a Margolus neighborhood, and the following rules for each block: 

[Step A] For each cell i whose state is 1, 
dm. = nr * a; 
m j = m j - dm.; 

[Step B] For each cell j whose state is 0, 
m = m + ( (dm I + dm 2 +. . .+ dm) / n ); 

where a is a constant rate for distribution ( 0 < a < 1 ), t is the number of cells 
over threshold and n is the number of cells under threshold. Here, we denote the state 
of a cell as 1 if its virtual clay is over the threshold. Otherwise, the state is 0. The 
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value dm. represents the excessive clay in cell i, which will be distributed to the 
neighboring cells. From these two steps, we can see that the total mass of virtual clay 
within a block is conserved during the state transitions. Figure 3 illustrates the transi- 
tion rules in 2D. 





Fig. 2. 2D block patterns. 



Moving plate 




| Cell over threshold 
A push operation Q ^ mder 



■=> 



(a) Initial state 



m 



(b)After one step 



Fig. 3. Push operation by a moving plate. 

The deformation of a virtual clay object is based on a push operation. The clay is 
transported from a cell into the adjacent ones along the direction of pushing. The 
surface of a virtual clay object can be pushed at most one cell in depth per step. 

Figure 3 shows an example of the push operation by a moving plate. This plate is 
modeled as a set of special cells to which virtual clay cannot be distributed. When the 
threshold is surpassed as the result of pushing, virtual clay is distributed according to 
the transition rules. The state transitions are repeated until there are no cells over 
threshold (a stable state). The number of steps to reach the stable state depends on the 
total mass of virtual clay, the threshold and the parameter a. 



4 Modeling a Virtual Clay Object in Cell-DEVS 

We used Cell-DEVS model to simulate the behavior of the deformation of a virtual 
clay object. Our focus is on a 3D free-form object, using the transition rules presented 
in the previous section. Figure 4 illustrates the 3D Margolus neighborhood we used, 
in which the nearest 8 cells make one block. Similarly to the 2D neighborhood, each 
cell belongs to different blocks in odd and even steps. 
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Neighborhood (odd step) 
.Objective Cell 

Neighborhood (even step) 



Fig. 4. 3D Margolus neighborhood 

The values in each cell represent the density (mass) of that cell. A cell with a value 
of zero means this cell is out of the object, while a positive value means the cell is 
within the object. The final state contains the free-form object in stable state after 
deformation. The transition procedure is done in two stages as follows. 

• Deformation: if there are cells with density over the threshold, the deformation 

transition rules are applied. 

• Compression: we assume that there is a moving plate and the virtual clay next 

to the plate is transferred from a cell into the adjacent cell along the direction 
of pushing. In the model, this plate is handled as a dummy plate with each 
cell having a value of zero, staying right on top of the object. 

During the transition procedure, each cell has its different neighbors at odd and 
even steps. The neighborhood of each cell is identified according to its location in the 
block. Meanwhile, each cell has different transition policies for the deformation and 
compression stages. Therefore, we used a four-dimensional Cell-DEVS model, in 
which each 3D hyperplane of (x, y, z) represents a set of state variables and the fourth 
dimension is a control variable, as follows. 

• Hyperplane 0 (x, y, z, 0) represents the free form object. 

• Hyperplane 1 (x, y, z, 1 ) defines the odd or even step so that each cell in Hy- 

perplane 0 can identify its Margolus neighborhood. 

• Hyperplane 2 (x, y, z, 2) is a control Hyperplane. Compression will be per- 

formed if (x, y, z, 2) = 1, and deformation if cell(x, y, z, 2) =0. 

This Cell-DEVS coupled model is defined as follows: 

Plastic = < Xlist, Ylist, X, Y, n, { t p ...,tj , N, C, B, Z >. 

X=Y=Xlist= Ylist= 0; S = {s| s >= 0, sG R }; «=4; f =10; t 2 = 9; t =12; t =12; 

N= {(-1,-1, -1,0), (-1,0, -1,0), (-1,1, -1,0), (0,-1, -1,0), (0,0, -1,0), (0,1, -1,0), (1,-1, -1,0), 

( 1 , 0 ,- 1 , 0 ),( 1 , 1 ,- 1 , 0 ),(- 1 ,- 1 , 0 , 0 ),(- 1 , 0 , 0 , 0 ),(- 1 , 1 , 0 , 0 ),( 0 , 1 , 0 , 0 ),( 0 , 0 , 0 , 0 ),( 0 , 1 , 0 , 0 ), 
(1,-1 ,0,0),( 1 ,0,0,0),( 1 , 1 ,0,0),(- 1 ,- 1 , 1 ,0),(- 1 ,0, 1 ,0),(- 1,1,1 ,0),(0,- 1 , 1 ,0),(0,0, 1 ,0), 
(0,1,1 ,0),( 1 ,- 1 , 1 ,0),( 1 ,0, 1 ,0),( 1,1,1 ,0),(- 1,-1 ,2,0), (- 1 ,0,2,0),(- 1 , 1 ,2,0), (0,-1 ,2,0), 
(0,0,2, 0),(0, 1 ,2,0),( 1 ,- 1 ,2,0),( 1 ,0,2,0),( 1 , 1 ,2,0),(0,0,0, 1 ),(0,0,0,2) } 

The definition of this Cell-DEVS coupled model using CD++ is illustrated in Fig- 
ure 5 below. 
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[plastic ] 

dim : (10,9,12,3) delay : transport 

border : nowrapped 

neighbors: (-1,-1, -1,0) (-1,0, -1,0) (-1,1, -1,0) 

( 0 , - 1 , - 1 , 0 ) ( 0 , 0 , - 1 , 0 ) ( 0 , 1 , - 1 , 0 ) ( 1 , - 1 , - 1 , 0 ) 

( 1 , 0 , - 1 , 0 ) ( 1 , 1 , - 1 , 0 ) (- 1 , - 1 , 0 , 0 ) (- 1 , 0 , 0 , 0 ) (- 1 , 1 , 0 , 0 ) 
localtransition : deformation-rule 
[deformation- rule] 



Fig. 5. Cell-DEVS coupled model specification in CD++ 

This Cell-DEVS model that has three hyperplanes of 10*9*12 each. The cell’s be- 
havior is defined by the localtransition clause, and it will be discussed in detail in the 
following sections. We will present the rules at deformation and compression stages 
and show how the transformation is controlled in Cell-DEVS formalism. The rules 
can be generalized as follows. 

1 . Perform deformation if cell (0, 0, 0, 2) = 0 and this cell is on Hyperplane 0. 

2. Perform compression if cell (0,0,0, 2) = 1 and this cell is on Hyperplane 0 

3. Even/odd step alternates if cell (0, 0, 0,1) = 0 and this cell is on Hyperplane 1 

4. Deformation/compression control alternates if this cell is on Hyperplane 2 

4.1 Deformation Rules in the Cell-DEVS Model 

The deformation stage involves the activating different rules at odd and even steps, in 
which we decide the different neighbors that the objective cell receives clay from or 
distributes clay to. Only hyperplane 0 (x, y, z, 0) performs the deformation transition, 
while hyperplane 1 (x, y, z, 1) helps to judge whether the cell in Hyperplane 0 change 
in odd step or in even step and hyperplane 2 (x, y, z, 2) identify the deformation 
stage. In each step its neighborhood can be located in a 2x2x2 block. In Figure 6 we 
show the mechanism to select the Margolus block. The origin cell is colored in gray 
and its neighbors are defined according to the coordinates shown in the figure. We 
repeat the procedure for other cells in the same Margolus block to obtain the neigh- 
bors of each objective cell in the same hyperplane. 
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Margolus Neighborhood in Plane 0 



|(omi)| 

Neighbor in Plane 1 



( 0 , 0 , 0 , 2 ) | 

Neighbor in Plane 2 



Fig. 6. A cell and its neighborhood definition at the deformation stage 



The pair < (x,y,z,0), (x,y,z,l) > define the step (odd or even). The neighbor on the 
hyperplane 0 is described in the figure, and its value, together with the value of all the 
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neighbors in the same Margolus block, can be used to decide which transition rule 
should be applied. The deformation rules can be generalized as follows (the detailed 
definition of these rules in CD++ can be found in [7], 

• Cells gain clay from neighbors on hyperplane (x,y,z,0) if (0,0, 0,2) = 0, 

(0, 0,0,0) is below the threshold, and at least one neighbor is above the 
threshold. 

• Cells distribute clay to neighbors on Hyperplane (x,y,z,0) if (0,0,0, 2) = 0, 

(0, 0,0,0) is above threshold and at least one neighbor is under threshold. 



4.2 Compression Rules in the Cell-DEVS Model 



Similar to deformation, the compression only takes place on hyperplane 0. Hyper- 
plane 2 controls when compression occurs: only when cell (x, y, z, 2) =1. During 
compression the clay in the cells right under the moving plate is transferred into the 
adjacent cells along the direction of pushing. The moving plate is represented by a set 
of cells with values of zero sitting on the top of the object. We assume the plate 
moves down along z-axis as shown in Figure 7. For each cell (x,y,z,0), if all neigh- 
boring cells (-1,-1, 2,0), (-1,0,2, 0), (-1, 1,2,0), (0,-1, 2,0), (0,0, 2,0), (0,1, 2,0), (1,-1, 2,0), 
(1, 0,2,0), (1,1, 2,0) are zero and at least one of neighbor cells (-1,-1, 1,0), (-1,0, 1,0), (- 
1,1, 1,0), (0,-1, 1,0), (0,0, 1,0) , (0,1, 1,0), (1,-1, 1,0) , (1,0, 1,0), (1,1, 1,0) is greater than 
zero, the cell should gain all clay in its neighbor (0,0, 1,0). The neighbors of each cell 
are defined in Figure 7. 



f z 

Pushing direction 




Y 



Neighbors in Plane 0 



(- 1 ,- 1 , 2 , 0 ) 


( 0 ,- 1 , 2 . 0 ) 


( 1 - 12 , 0 ) 


(- 1 , 0 . 2 , 0 ) 


( 0 , 0 . 2 , 0 ) 


( 10 . 2 , 0 ) 


(- 1 , 12 , 0 ) 


( 0 . 12 , 0 ) 


( 1 , 12 , 0 ) 


(- 1 - 11 . 0 ) 


( 0 ,- 110 ) 


( 1 - 11 , 0 ) 


(- 1 , 0 , 10 ) 


( 0 , 0 , 10 ) 


( 1 , 0 , 10 ) 


(- 1 , 1 , 10 ) 


( 0 , 1 , 10 ) 


( 1110 ) 



( 0 , 0 . 0 , 0 ) 



Neighbor in Plane 2 

( 0 , 0 , 0 , 2 ) 



Fig. 7. A cell and its neighbor definition at the compression stage 

The definition of these rules in CD++ is as follows: 

[compress ion- rule] 

%plate moving 

rule : {(0,0,0,0) + (0, 0,1,0)} 100 { (0,0, 0,2) =1 and 

cellpos(3)=0 and cellpos(0)>0 and cellpos(0)<9 and 
cellpos(l)>0 and cellpos(l)<8 and cellpos(2) <10 and 
((-l,-l,2,0)+(-l,0,2,0)+(-l,l,2,0)+(0,-l,2,0)+(0,0,2,0) 
+(0,1,2,0)+(1,-1,2,0)+(1,0,2,0)+(1,1,2,0))=0 and 
((-l,-l,l,0)+(-l,0,l,0)+(-l,l,l,0)+(0,-l,l,0)+(0, 0,1,0) 
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+( 0 , 1 , 1 , 0 )+( 1 , - 1 , 1 , 0 ) +( 1 , 0 , 1 , 0 )+( 1 , 1 , 1 , 0) )>0 } 

%step 1: add the first row to the second row 
rule : 0 100 {(0,0, 0,2) = 1 and cellpos(3)=0 and 
cellpos(0)>0 and cellpos(0)<9 and cellpos(l)>0 and 
cellpos(l)<8 and cellpos(O) < 11 and ( ( -1 , -1 , 1 , 0 ) + 

( - 1 , 0 , 1 , 0 ) + ( - 1 , 1 , 1 , 0 ) + ( 0 , - 1 , 1 , 0 ) + ( 0 , 0 , 1 , 0 ) + ( 0 , 1 , 1 , 0 ) + 
(1,-1, 1,0) + (1, 0, 1, 0) + (1, 1, 1, 0) ) =0 and ( ( -1 , -1 , 0 , 0 ) + 
(-l,0,0,0)+(-l,l,0,0)+(0,-l,0,0)+(0,0,0,0)+(0, 1,0,0)+ 
(1,-1, 0,0) +(1,0,0,0)+(1,1,0,0) )>0} 

%step2 : change the first row to 0, the plate has moved 
one step further 

Fig. 8. Cell-DEVS specification of the compression rules in CD++. 



4.3 Control Hyperplane in Cell-DEVS Model 

Hyperplane 2 controls when the deformation or compression stages occur. The value 
of each cell on hyperplane 2 switches between 0 (deformation) and 1 (compression). 
We assume that the transport delay of performing a compression step is 3000 ms, 
thirty times longer than the delay of a deformation. The 3D free-form object will 
reach stable state within 3000 ms if deformation occurs. 

The transition rule for the control hyperplane is as follows. 

• S 1 if cell (0,0,0, 1 )=0 and cell (0,0, 0,0) =0 and cell itself is on hyperplane 1 

• S <-0 if cell (0,0,0. 1 )=0 and cell (0,0, 0,0) =1 and cell itself is on hyperplane 1 

• S <- 1 if cell (0,0,0, 0) =0 and cell itself is on hyperplane 2 

• S <- 0 if cell (0, 0,0,0) = 1 and cell itself is on hyperplane 2 

The specification of these rules in CD++ is illustrated in Figure 9 below. 

rule : 1 100 {(0,0,0,1)=0 and cellpos(3)=l and 

( 0 , 0 , 0 , 0)=0 } 

rule : 0 100 {(0,0, 0,1) = 0 and cellpos(3)=l and 

(0,0,0,0)=1> %alternate Margolus neighborhood 
rule : 1 3000 { cellpos(3)=2 and ( 0 , 0 , 0 , 0 ) =0 } 
rule : 0 100 {cellpos (3 ) =2 and ( 0 , 0 , 0 , 0 ) =1 } 

%plate moving 

Fig. 9. Cell-DEVS specification of control rules in CD++ 



5 Experimental Results 



We executed this Cell-DEVS model, and studied the deformation process of a clay 
object. We studied each cell at different time steps (compression or deformation), and 
the total mass of the object was examined. We found that the total mass (represented 
by cells in hyperplane 0) was conserved for every transition. 
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Some of the results obtained are presented in Figure 10. We show several steps 
during the transition process, which includes the initial state, first three compression 
steps and some related deformation steps. Figure (b) shows the immediate result after 
the first compression step. Figures (c) and (d) show the object deformation. In Figure 
(d), a stable state is reached. Figure (e) to (i) show the repartition of clay after the 
second and third compression steps. 




(a) Initial state 




(d) Deformation, stable 




(g) Third compression 





(h) Deformation 





(c) Deformation 



(f) Deformation, stable 




(i) Defonnation, stable 



Fig. 10. The deformation of the free-form object using Cell-DEVS 
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6 Conclusion 

We have showed how to model the deformation of 3D free-form object using the 
Cell-DEVS formalism. The total mass of the free-form object is conserved during the 
stages of deformation. The complex behavior has been simulated using simple and 
easy-to-understand transition rules. These rules are defined using the specification 
language in CD++, which is itself is simple and easy to learn. A similar study has 
been carried out in [1] and [2], However, in their work, a sophisticated simulation 
program was developed in order to simulate this complex behavior. The simulation 
program is application-specific, which can not be applied to general cases. By apply- 
ing the Cell-DEVS formalism along with the CD++ toolkit support, more generality 
can be achieved. This allows us to model complex systems in a simple fashion. Like- 
wise, changing parameters for studying simple conditions is straightforward. Cell- 
DEVS permits defining zones of cells with different behavior, which can be used to 
analyze objects composed of different materials. Likewise, Cell-DEVS models can be 
integrated with other DEVS and Cell-DEVS models, permitting defining multimodels 
interacting with each other, using different modeling techniques. 

One of the key aspects to simulate the deformation is to understand the character- 
istics of the Margolus neighborhood. This has been studied and explored in the model 
specification. The results, which have been illustrated by a visualization tool have 
demonstrated the capability of Cell-DEVS modeling approach as well as the tool 
utilities. The behavior of the deformation is well simulated with a simple, easy- 
understood mechanism. 
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Abstract. There is increasing interest in cellular automata that up- 
date their cells asynchronously, i.e. , at random times and independent 
of each other. Research, however, has been limited to either models of 
trivial phenomena, or models that require a global synchronization mech- 
anism to induce nontrivial phenomena like computation. This paper em- 
ploys local synchronization, a technique in which particular temporal 
sequences of states are imposed locally on cells and their direct neigh- 
bors, while the exact timing of state transitions is left undetermined. A 
hexagonal asynchronous totalistic cellular automaton is presented that 
achieves a completely asynchronous way of computation by simulating 
delay-insensitive circuits, a type of asynchronous circuits that are known 
for their robustness to variations in the timing of signals. We implement 
three primitive operators on the cellular automaton from which any ar- 
bitrary delay-insensitive circuit can be constructed, and show how to 
connect the operators such that collisions of crossing signals are avoided. 
The model requires six states and 55 totalistic transition rules. 



1 Introduction 

Cellular Automaton (CA) models have been used extensively for studying dy- 
namical systems. Simulations of CAs on computers [15] suggest that the evolu- 
tion of CAs from disordered initial states can be grouped into four classes: 

I. Evolution leading to a homogeneous state, 

II. Evolution leading to a set of separated simple stable or periodic structures, 

III. Evolution leading to a chaotic pattern, 

IV. Evolution leading to complex localized structures, sometimes long-lived. 

This classification has mainly been investigated for synchronous CAs. Most dy- 
namical systems have an overall behavior that appears asynchronous, suggesting 
that synchronous CA models are not the optimal choice for simulating them. To 
make matters worse, some of the phenomena observed in synchronous CAs are 
actually artifacts of the synchronous mode of operation. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 91—100, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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This motivates study into Asynchronously Cellular Automaton (ACA) mod- 
els. According to [5], asynchronous updating makes it more difficult for infor- 
mation to propagate through AC As, and thus tends to prevent initial perturba- 
tions from spreading. A system whose level of asynchronicity is above a certain 
threshold, rather tends to converge to a randomly fluctuating steady state [3], 
suggesting class-I and class-II behavior. How can we induce class-III and class-IV 
behavior in ACAs? Computation, generally associated with class-IV behavior, 
has been studied on ACAs, but mainly by simulating a synchronous CA on an 
ACA [9], after which a synchronous model, like a Turing Machine, is simulated 
on the synchronous CA. This method suffers from an increased number of cell 
states required for the ACA. Moreover, the ACA is de facto synchronous. ACAs 
free from these shortcomings have been proposed [1,7,12,13,14], that, rather than 
relying on global synchronization mechanisms, conduct asynchronous computa- 
tions directly by simulating Delay-Insensitive (DI) circuits (see also [4]). A DI 
circuit is a type of asynchronous circuit, in which delays of signals do not affect 
the correctness of circuit operation. Configurations are designed on the ACA of 
cells in appropriate states that behave like signals in DI circuits and like modules 
whose operations change the directions or numbers of signals. Put together in 
appropriate ways, the configurations operating on signals can be used to form 
DI circuits that conduct computations. 

This paper proposes an ACA that conducts its computations in a purely 
asynchronous way, adopting the same local synchronization method as in [1,7,12, 
13,14]. Unlike previous ACAs, however, the proposed ACA assumes a hexagonal 
neighborhood and totalistic updating — reason to call it Hexagonal Asynchronous 
Totalistic Cellular Automaton (HATCA). The DI circuits implemented on the 
ACA are based on a novel set of primitive DI elements that are especially suitable 
for implementations on hexagonal cellular spaces. Consequently, our ACA is 
more straightforward than the comparable ACA in [1], which is also totalistic 
but based on a squared cellular space with Moore neighborhood. Though both 
the ACA in this paper and the ACA in [1] require 6 states, our ACA requires 
less transitions rules (55 against 85). It is also less restricted in its asynchronous 
updating scheme, in the sense that, unlike in [1], two neighboring cells are allowed 
to undergo transitions simultaneously. In a totalistic CA like the proposed ACA, 
the total numbers of cells in certain states in a cell’s neighborhood determine the 
transitions the cell can undergo. A well-known totalistic synchronous CA is the 
Game of Life [2], which exhibits behavior that is in class-IV. Our results may 
lead to more realistic CA-based models of physical phenomena and to massively 
parallel computation models with improved physical realizability. 

Section 2 introduces DI circuits. We describe three primitive modules by 
which any arbitrary DI circuit can be constructed. In Section 3, we define 
HATCA, and in Section 4, we implement DI circuits on HATCA and describe 
the transition rules required for this. We show how to implement the signal and 
the primitive modules on HATCA, and how to connect them in circuits without 
collisions of crossing signals. We finish with concluding remarks. 
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2 Delay-Insensitive Circuits 

A DI circuit is an asynchronous circuit whose operation is robust to arbitrary 
signal delays. A Di-circuit needs no central clock since it is driven by input 
signals; it may thus be called an event-driven circuit. The circuit is composed 
of paths (wires) and modules. Signals are transmitted along the paths and are 
processed by the modules. 

Any arbitrary Boolean circuit in a synchronous system can be constructed 
from a fixed set of primitive operators, for example from AND-gates and NOT- 
gates. Such a set is called universal. Likewise, there are universal sets of prim- 
itives for Di-circuits, for example [6,11,8]. We use a universal set of primitives 
that is particularly suited for hexagonal cellular automata. It consists of the 
following three modules: 

— FORK: A module (Fig. 1(a)) that, upon receiving an input signal from its 
input path A, produces one output signal in each of its output paths B and 

C. 

— MERGE: A module (Fig. 1(b)) that redirects an input signal from either 
input path A or B to output path C. 

Arbitrating TRIA (A-TRIA): A module (Fig. 1(c)) with three input 
paths (A, B, and C) and three output paths (P, Q, and R), which has two 
functionalities. In the first functionality, it acts like a TRIA module [11]: 
when it receives one signal from A (B) [C] and one signal from B (C) [A], 
it sends an output signal to P (Q) [R]. The second functionality is similar, 
except that the A-TRIA allows arbitration when the input paths A, B, and 
C receive signals at the same time. The A-TRIA then arbitrarily selects two 
input signals and operates on them as if it were a normal TRIA, while leaving 
the third input signal pending , i.e. holding it as is. 

This set of modules is universal, that is, any arbitrary DI circuit can be 
constructed from them. As an example, a module called a SEQUENCER, is 
constructed from three FORK, two MERGE and three A-TRIA modules, as 
shown in Fig. 2. A SEQUENCER arbitrates between two input signals, one on 
ai and one on hi, letting one of them pass whenever there is an input signal 
from a third path c. If the signal from a,; (resp. b,) passes, an output signal is 
produced on a 0 (resp. b 0 ). 

3 Hexagonal Asynchronous Totalistic Cellular Automata 

A CA consists of identical finite automata arranged as cells in a regular d(d > 1)- 
dimensional array, such that they are mutually connected to their neighbors [10, 
2,7]. Each cell is in a state from a finite state set. The states in this paper, 
0, 1, 2, 3, 4, and 5, are denoted by the symbols in Fig. 3. A cell with state 
0, called quiescent cell , is commonly used for background. The state of a cell 
at time t changes to a state at time t + 1 in accordance with a local function 
/, which has the cell’s state and the neighboring cells’ states as inputs. In a 
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Fig. 1 . Primitive modules: (a) FORK, (b) 
MERGE, and (c) A-TRIA 



Fig. 2. A SEQUENCER module 
constructed from three FORKs, two 
MERGEs and three A-TRIAs. The two 
blobs (small filled circles) on the paths 
denote pending signals 
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Fig. 3. The symbols by which the cell Fig. 4. Update of a cell by a transition 
states are encoded 




conventional CA, all cells undergo transitions at the same time; in other words, 
the CA is synchronous. Consequently, the global states are uniquely determined 
at all times by the local function /. 

If cell transitions occur randomly and not necessarily at the same time, the 
resulting model is an asynchronous CA. In this paper, we use an asynchronous 
CA which includes synchronous updating, that is, neighboring cells may undergo 
transitions at the same time, but not necessarily so. This is a general case of an 
asynchronously updating scheme [9,7], and it is called completely asynchronous. 
In an asynchronous CA, the global states are not uniquely determined at all 
times by the local function /, because of the randomness by which transitions 
are timed. 

The model used is a 2-dimensional hexagonal asynchronous totalistic CA, 
that is, a hexagonal CA in which transitions are timed completely asyn- 
chronously according to totalistic transition rules. The transition function is 
a function of a cell’s state and of the numbers of neighboring cells in certain 
states: 



Co = f(c 0 ,{n k }) 
n k = |{ci|ci = k}\ , 



(1) 

(2) 
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where, Co and d are the states of cells located at 0 and i (i = 1,2, ... ,6) before 
the update, and c' Q is the state of cell 0 after the update (see Fig. 4). The variable 
rik denotes the number of cells in state k (k = 0, 1, 2, . . . , N — 1), where N is 
the number of possible states. In a completely asynchronous updating scheme, a 
transition takes place in a certain cell with a certain probability p , but only so if 
the local function is defined for the particular combination of a cell’s state and 
its neighboring cells’ states. If the local function is undefined, no transition takes 
place, even if the cell is selected by the scheme. In other words, the updating 
scheme is as follows: 

1. for all i, j: 

2. select the cell at position (i.j) 

3. determine random number q £ [0,1] 

4. if q < p and /(cy, {«.&}) is defined: 

5. do transition cb = f(dj, {u-fc}) 

6. for all i, j: replace dj by c' ; - 

4 Implementing Delay-Insensitive Circuits 

4.1 Signals 

A signal consists of a configuration of cells in certain states that moves in a 
certain direction over time. We call a continuous area of cells along which a 
signal propagates a path. The propagation of a signal requires rules 1 to 8 in 
Table 1 and is illustrated in Fig. 5. The signal in Fig. 5(a), composed of the four 
cells in states 1 and 2, propagates along a path in the eastern direction, using 
state 3 in the process. In front of the signal configuration, there is a cell in state 
0 that has five neighbors in state 0 and one neighbor in state 2. The only rule 
matching to this situation is rule 1, and applying this rule to the cell changes 
its state from 0 to 3, yielding configuration (b). After this, configuration (b) is 
transformed to one of the configurations in (c) by rule 2, and this is followed by 
the intermediate configuration in (d) due to rule 3. Subsequently, configuration 
(d) is transformed to (k) via (e)-(f)-(g) or (h)-(i)-(j). A configuration in (e) may 
also evolve into a configuration in (i), but we leave the arrows out in Fig. 5 for 
simplicity; similarly (f) may evolve into (j). 

Due to the asynchronicity of the model, the order in which the cells in Fig. 5 
undergo transitions may vary, and even subsequences of transitions different 
from those in Fig. 5 may occur. The standard form of the signal in Fig. 5(a), 
however, will always reoccur. Finally, the transition rules are designed such that 
successive signals moving along the same path always remain separated by at 
least one quiescent cell. 

4.2 Primitives 

To implement the three primitive modules {FORK, MERGE, A-TRIA} in Sec- 
tion 2 on HATCA, we use six states, of which states 4 and 5 are specifically for 
the modules. 
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Fig. 5. Development of a signal on HATCA. A number above an arrow denotes the 
applied rule. Signal configuration (a) is transformed to (b) by rule 1 in Table 1; con- 
figuration (b) is transformed in one of the configurations in (c) by rule 2; and that 
configuration is transformed to (d) by rule 3. The intermediate pattern (d) is trans- 
formed to (k) via (e)-(f)-(g), (e)-(i)-(j), (e)-(f)-(j), or (h)-(i)-(j) 



Table 1. Transition rules for propagation of a signal (No. 1-8), for FORK module (No. 
9-22), for MERGE module (No. 23-33), and for A-TRIA module (No. 34-55) 
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The FORK is composed of three cells in state 4. Upon being presented by an 
input signal, the FORK typically operates like in Figs. 6 (a)-(l). The transition 
rules required for the FORK are 9 to 22, listed in Table 1. 
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(g> {«> PI (I) M (i) 

Fig. 6. Configuration and operation of FORK module 




(9) <h) (i) (I) M (l) 



Fig. 7. Configuration and operation of MERGE module 



The MERGE is composed of two cells in state 4 and one cell in state 5. Upon 
being presented by an input signal, the MERGE typically operates like in Figs. 7 
(a)-(l). The transition rules for the MERGE are 23 to 33, listed in Table 1. 

The A-TRIA is composed of one cell in state 5. Its TRIA functionality is 
illustrated in Fig. 8: upon being presented by two input signals, these signals 
are joined such as to produce one output signal. The arbitration functionality 
of the A-TRIA is illustrated in Fig. 9: upon being presented by three input 
signals, two of the signals — chosen arbitrarily due to the asynchronous mode of 
updating — are joined to produce one output signal, while the third input signal 
is kept pending. The transition rules required for the A-TRIA are 34 to 55, listed 
in Table 1. 



4.3 Crossing Signals 

Realizing a particular DI circuit is a matter of connecting the primitives to each 
other by paths in an appropriate way and laying them out on the CA. Special 
care is required for paths that cross each other, since the CA lacks a third 
dimension to cross signals. If crossing signals collide, they need to resolve which 
of them may pass first. Though most circuits can be designed such that crossings 
are collision- free, in some circuits it is hard to guarantee this. It is possible to 
add rules for crossing signals, but there is a more elegant way based on a module 
specialized in crossings [1,7,12,13] that is similar to a SEQUENCER. 
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Fig. 8. Configuration and operation of A-TRIA module when being presented by two 
input signals. The signals are joined, resulting in one output signal 





(k) (I) (m) (n) (o) (p) 



Fig. 9. Configuration and operation of A-TRIA module when being presented by three 
input signals. Two of the signals are arbitrarily selected, and joined, while the third 
signal is kept pending. The evolutions into the three possible outcomes are shown fully 
in (c)-(p) and briefly in (c)-(q) and (c)-(r) 



The module is realized using only our primitives and collision-free path cross- 
ings. The implementation of the crossing on HATCA is shown in Fig. 10. A signal 
arriving at a,; (resp. bi) gives rise to an output signal at a 0 (resp. b 0 ). In case 
signals arrive at a 0 and b 0 simultaneously, only one is allowed through by the 
A-TRIA just left of the center. To this end, the A-TRIA tries to join one of 
these signals with the control signal, input at its right side. If this fails, i.e., if 
the A-TRIA rather joins a 0 with 6 0 , a feedback signal is generated that gives 
rise to a new a Q and b Q . This procedure is repeated until one of a Q and b Q gets 
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Fig. 10. Implementation of a delay-insensitive crossing module on HATCA. The arrows 
show a typical example of a signal passing through the crossing from ai to a 0 



through. The crossing module can be used under all circumstances: the order by 
which the signals arrive at its inputs is irrelevant for its correct functioning. 

5 Concluding Remarks 

To compute in a purely asynchronous way on an asynchronous hexagonal cel- 
lular automaton, we have embedded DI circuits on it. As we can construct any 
particular DI circuit by connecting the primitives proposed in this paper in ap- 
propriate ways, a universal Turing Machine can be build [8]. Other examples on 
how to build simple circuits on ACAs are given in [1,7,12,13]. A discussion on 
the implementation of DI circuit-based program structures, such as for-loops, 
etc., on ACAs can be found in [12]. Since computation is within the reach of the 
proposed ACA model, it may be considered as belonging to class-IV if Wolfram’s 
classification is extended to asynchronous systems. 

In the approach followed in this paper and in [1,7,12,13,14], synchronization 
is done locally, as opposed to the traditional approach of simulating a globally 
synchronized CA on an ACA. A cell will only undergo a transition if there is a 
transition rule that matches the state of the cell and the states of its neighbors. 
If there is no such match, the transition cannot be accomplished, and the cell 
will wait until one or more of its neighbors change states such that a match 
ensues. As this mode of operation is often found in nature, where for example 
chemical reactions only occur when the right molecules are available in the right 
positions at the right moments, this approach may lead to improved ways to 
simulate physical systems by ACAs. 
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Abstract. Motivated by the wish to make large simulations of Bak’s 
sandpile model we investigate a simple method to simulate cellular au- 
tomata with “few activities” efficiently on a computer. 



1 Introduction 

Simple ideas are sometimes more useful than complicated ones - if they are 
noticed. In this paper we pursue a simple idea to simulate cellular automata 
with small state change complexity [4] on a computer. 

Self-organized criticality is a phenomenon which several systems and models 
of them exhibit. One of the prototype examples is the sandpile model by Bak, 
Tang and Wiesenfeld [2] . The CA part of this model has the property that most 
of the time only very few cells change their states even if the lattice is large 
(which is the interesting case). 

In such a situation the straightforward simulation technique of computing the 
new state for each cell in each global step does a lot of superfluous work: When 
a cell p and all of its neighbors do not change their states during a global update 
of a CA one can be sure that p will again not change its state during the next 
global update. One could at least hope that in such a situation restricting the 
simulation to cells where a state change might happen would pay off. We call 
this the adaptive simulation algorithm. Depending on how little or how much 
bookkeeping work is necessary it will be competitive for configurations with 
relatively many or few “active” cells. 

The rest of this paper is organized as follows: In Section 2 we quickly review 
the definition of CA (thus introducing the notion we will use in the sequel) and 
explain the sandpile model. In Section 3 we describe the adaptive simulation 
algorithm, prove its correctness and discuss some implementation issues. In Sec- 
tion 4 results of simulation runs for different CA are reported and evaluated. We 
conclude the paper in Section 5. 



2 Basics 

2.1 Cellular Automata 

We assume that the reader is familiar with the concept of deterministic syn- 
chronous cellular automata. We use the following notations. 
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By R we denote the regular grid of positions where the cells are located. We 
assume that R is an additive group with neutral element 0. We use x, y , 2 for 
the coordinates of cells. The finite set of states is always denoted as Q. Thus a 
(global) configuration is a mapping c : R — > Q, or c £ Q R for short. We usually 
write c x instead of c(x) for the state of cell x. The local neighborhood N is a 
finite set of differences of coordinates. We write x + N = {x + n\ n£ N} for the 
coordinates of all neighbors of cell x. Analogously x — N is the set of coordinates 
of all cells for which a; is a neighbor. In general we do not require 0 £ N, and 
we write N 0 for the set N U {0}. 

A mapping l £ Q N is a local configuration. Given a global configuration c 
we write c^+jv for the local configuration observed by cell x, i.e. c x +n '■ N — » Q : 
n 1 ^ c x + n . The local transition function is a mapping 6 : Q N — > Q. It induces 
the global transition function A : Q R Q R by the prescription that for all 
x € R must hold: A(c) x = S(c x+ n). 

2.2 Sandpile Model 

The sandpile model by Bak, Tang and Wiesenfeld [2] actually consists of two 
parts. One is a CA, the other is a sequence of “external events”. We will only 
consider the original simple version. 

The CA part works as follows. It is a two-dimensional CA using von Neumann 
neighborhood of radius 1. The set of states is Q = {0, 1, 2, 3, 4, 5, 6, 7}. The local 
rule can be described like that: Each cell is a little bin which contains a certain 
number q £ Q of grains of sand. In each step each cell does the following: 

— If a cell is critical , i.e. q > 4, then “it takes 4 of its grains of sand and puts 
one in each of its 4 neighboring bins.” This is called the toppling of a cell. 

— If a cell is not critical, i.e. q < 3, then “the cell does nothing” by itself. (It 
may still receive one or more grains from its neighbors.) 

One does not use cyclic boundary conditions. Instead, if for example a cell at the 
left border tries to put a grain of sand to its left neighbor, the grain simply “gets 
lost” . That way one can always be sure, that no matter what the initial configu- 
ration looks like, after a finite number of CA steps one will reach a configuration 
in which all cells are not critical. 

Given such a configuration one repeatedly does the following: 

1. Uniformly one of the cells is chosen at random and one grain of sand is added 
to it. 

2. Starting at this cell there may be an avalanche. As long as there is at least 
one critical cell, the CA does one global step. 

There are several quantities which could be used as the size of an avalanche: the 
number of global CA steps needed to reach a quiescent configuration, the total 
number of cells “involved” in an avalanche or total number of cell topplings. For 
the figures in this paper we have used the number of global CA steps. 

The reason why for example geophysicists are interested in this (and similar 
more advanced) models is that the distribution of frequencies of avalanches of 
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different sizes is similar to the distribution of frequencies of earthquakes of dif- 
ferent magnitudes. This is true at least if one has already reached a global state 
of so-called self- organized criticality. 

3 Simulation Technique 

The conventional algorithm for simulating a CA works with two global configu- 
rations c and c! . In c the current states of all cells are stored. By iterating over 
all cells x £ R for each of them the new state is computed and stored in d. At 
the end the roles of c and c' are exchanged to prepare for the simulation of the 
next global step. 

Algorithm 31 (Conventional simulation method for one global CA 
step). 

foreach x £ R do 

c 'x 3 ( c x+n) 

od 

c o c' 

This algorithm always computes new states for all cells. It does so even for cells 
which are inside a neighborhood where no actual state changes (from s to some 
s' s) took place recently. 

3.1 Abstract Version of the Adaptive Algorithm 

The following algorithm tries to be more clever. It maintains a set A C R of 
cells, such that whenever x A one can be sure that one need not compute a 
new state (because c' already contains it). 

Here is the algorithm, already extended by a few assertions needed for its 
verification: 

Algorithm 32 (Adaptive simulation method for one global CA step). 

A1 ( dx e R : x £ A ==> c' x = c x Ac x = S(c x+N ) ) 

foreach ieido 

C x ${ c x+n) 
if c' x y^ c x then 

foreach y £ x — Nq do 
A! £- A! U {y} 

od 

fi 

od 

A2 ( d = Z\(c), i.e. dx £ R: d x = A(c) x } 

A3 ( Vx £ R : x ^ A' => d x = c x A d x = 5(d x+N ) ) 

c d 

A £- A'; A' <- 0 

A4 (dx £ R: x <£ A => d x = c x Ac x = 5(c x+N ) ) 
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3.2 Correctness 

Assume that assertion A1 is correct, and consider an arbitrary x £ R just before 
the foreach a; € A do loop is entered. 

Case 1: x £ A. Then the loop body is executed for x. Obviously one gets d x = 
5{c x+ n) = A(c) x as claimed in assertion A2. 

Case 2: x £ A. In this case already by assertion A1 one can be sure that d x = 
S(c x+N ) = A(c) x holds. 

Therefore assertion A2 is true. 

Now consider the set A! after the foreach z € A do loop has finished and 
an arbitrary y £ R: 

Case 1 : y £ A! . In this case there is no need to prove anything. 

Case 2: y ^ A ! . This can only happen if ->3x £ R : y £ x — N 0 A d x ^ c x . This 
is equivalent to ~Ax £ R : x £ y + N 0 A d x ^ c x , and this to Va : £ R : x ^ 
y + N 0 V d x = c x . 

Since y £ y + Nq one has d y = c y . 

Furthermore d x = c x holds for all a: £ y + N, i.e. d y+N = c v +n, and therefore 
<5( c y+Ar) = H c v+n)- We already know from A2 that A(c) y = d y , hence 
<5(c' y+Ar ) = d y . 

Therefore assertion A3 is true. 

Assertion A2 tells us that the algorithm really computes d = Z\(c). 

We note that the 0 is really necessary in the loop foreach y £ x — Nq do . 
If a neighborhood does not contain 0 one would otherwise get wrong results in 
general. One can easily get this detail wrong. 

If one uses the above algorithm inside another loop to simulate several global 
steps of a CA, assertion A1 is an important loop invariant. Because of the state- 
ments c ££ d and A <— A' from assertion A3 one gets assertion A4 immediately. 
And this is in fact identical with assertion Al. 



3.3 Implementation in C 

For the measurements reported in the next section, we used an implementation 
of the above algorithm written in C. For the tests we used implementations of 
local CA rules for a shift and the sandpile model. 

In both cases the state of each single cell was stored in char variable. Thus 
global configurations were char arrays. Because we wanted to run computations 
for different R , the memory for them were malloced one-dimensional char arrays 
and the transformation to two-dimensional coordinates was done by hand. 

See Subsection 4.3 below for remarks on different storage schemes. 

The most important aspect is an efficient implementation of the (mathemati- 
cally speaking) sets A and A! of active cells. We used the following approach: For 
A and A! one-dimensional arrays currActList and nextActList (respectively) 
of elements are malloced in which cell coordinates are stored. 
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— Variables currActlndex and nextActlndex are both initialized with 0. 

— Adding the position p of a cell, its coordinates for short, to a list is done by 
a statement like 

nextActList [nextActIndex++] = p; 

In order to avoid that the coordinates of one cell are added to the activity list 
several times, there is an additional char array actArray which stores one bit 
of information for each cell: 

— If for some index p the bit actArray is 0, this means that p has not yet been 
added to nextActList. 

— If for some index p the bit actArray is 1, this means that p has already been 
added to nextActList. 

Therefore actArray [p] has to be updated when p is added to nextActList. 
Without going into too many details, the C code basically looks as follows: 

for (i=0; iCcurrActlndex; i++) { 
p = currActList [i] ; 
nextConf [p] = delta(currConf , p) ; 
if (nextConf [p] ! = currConf [p] ) { 

for all q in p-NO { 

if (! act Array [q] ) { 

nextActList [next Act Index++] = q; 
actArray [q] = 1 ; 

> 

> 

> 

} 

swap the pointers currConf and nextConf ; 
for (i=0; iCnextActlndex; i++) { 
actArray [nextActList [i] ] =0 ; 

> 

swap the pointers currActList and nextActList; 
currActlndex = nextActlndex; 
nextActlndex = 0; 

Already in the conventional simulation algorithm one has to be careful when 
trying to access “neighbors” p + n of border cells p. Often this problem can be 
dealt with by allocating additional memory for border “ghost cells” which are 
set to their proper values outside the global CA step. 

In the adaptive algorithm an additional similar problem comes up when 
looking at the potential cells p — n of which a cell p is a neighbor. In order to 
avoid lots of if statements for testing whether p — n is still within bounds, we 
used a method inspired by the idea of localization suggested in [6] . There is yet 
another array localize with ghost cells holding for each cell the information 
whether it is within the bounds of the real CA lattice. Indexing it with p — n 
gives the information whether it should be added to A! . 
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For meaningful results currActList has to be initialized correctly, i.e. asser- 
tion A1 has to be satisfied. In the case of the shift we simply included all cells in 
currActList. In particular in the case where there usually are only few active 
cells, this spoils the total running times for a not too large number of global 
steps. 

Figure 1 shows an example. For two initial configurations with 100 and 10 000 
ones the average time needed for one global step is determined by making an 
increasing number k of global steps and dividing the total running time by k. 




number k of iterations 

Fig. 1 . Average time needed for one global step computes by averaging over an in- 
creasing number of iterations for two different initial configurations 



Readers may have noticed that the above implementation is memory inten- 
sive. It may happen that all cells are active during several global steps. Hence 
currActList, nextActList, actArray, and localize all need the same amount 
of memory as currConf and nextConf . 

In the implementation of the sandpile model we use a pseudo random number 
generator by L’Ecuyer which his known to be very good [3, p. 280]. 

4 Simulation Results 

In this section we compare the running times of the conventional and the adap- 
tive simulation method for a few example CA. 

All actual numbers have been obtained on a Pentium 4 with 3 GHz clock 
frequency with 2 GB of RAM running Debian GNU/Linux with a 2.6.x kernel. 
The programs had been compiled with gcc version 3.3.3 using option -02 (other 
optimizations didn’t help). Running times are taken using the gettimeofday () 
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function. It is called immediately before and after the computations to be bench- 
marked and the time difference is reported. Thus we only report wall clock times 
which in general do not exactly represent the running times of the process we 
interested in but may be spoilt by activities of the operating system and/or other 
system or user processes. 

What we do want to get across in this paper is that there are interesting 
applications which benefit significantly from the adaptive simulation algorithm. 
As will be seen below, significantly here means a speedup factor in the range of a 
few thousands. In the light of this we do not really care whether concrete running 
times are off by a few percent due to OS activities. (Similarly optimizations of 
the source code could also affect the speedup factor.) 

4.1 Shift 

The first CA we investigate simply does a shift of all cells “down one row” in 
each step. Each state was stored in a variable of type unsigned char. 

All simulation results reported in this and the next subsection have been 
obtained for CA with 1000 x 1000 cells and cyclic boundary conditions. Times 
were taken for 10 000 global iterations and then divided by this number. This 
eliminates deviations which are due to operating system activities and the like 
to a large extent. 

Different initial configurations were chosen, where a randomly chosen subset 
(of the proper size) of cells was set to 1. 

For the conventional simulation method the number of ones in a configuration 
of course has no influence on the time needed for a global step. But for the 
adaptive method it has. As can be seen in Figure 2 for our implementations 
of shift down there is a break-even point at about 0.09%. Since the number of 
active cells in each global step is twice the number of 1-cells this indeed means, 
that the adaptive algorithm is faster only as long as the number of active cells 
is really small. At first this looks not very promising. But two important facts 
have to be taken into account: 

1. There are interesting CA like the sandpile model where the number of active 
cells usually is really small. 

2. In the case of the shift CA the number of active cells must be small for the 
adaptive algorithm to be competitive because the conventional implementa- 
tion of the local rule is extremely efficient; we can use a simple memcpyO 
for it. If in the conventional case the local rule takes more time, then the 
overhead of the adaptive algorithm becomes less disadvantageous. 



4.2 Sandpile Model 

In Section 2.2 we mentioned that for the sandpile model to produce results which 
are relevant for the application one must already have reached a configuration 
which exhibits self-organized criticality (SOC). To give a vague impression of 
what this means the following experiment was carried out. We started with a 
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Fig. 2. Average running times of the shift CA depending on the fraction of cells (in 
percent) in state 1 in the initial configuration for the conventional (“conv” line) and 
the adaptive (“adap” line) algorithm 



completely “empty” configuration where all cells were in state 0. Then grains 
of sand are thrown onto it. For each batch of 1000 grains we record the sum 
(over all grains) and the maximum (for one grain) of global CA steps and the 
CPU computation time needed for the batch as well as the relative frequencies 
of states 0, . . . , 3 after it. The results are shown in Figures 3 and 4. There 
is a quite sharp change of behavior which occurs for 1000 x 1000 arrays after 
approximately 2 140 000 grains. (For the curious: Experiments with other array 
sizes indicate that in general the point is at about 2.140 times the number of 
cells. But these investigations are off-topic in this paper.) 

The speedups for the adaptive over the conventional algorithm depend on 
the type of configuration, when started with an empty array, during the first 
batches we observed speedups between 7000 and 8000 for a 1000 x 1000 lattice. 
For a SOC configuration speedups were between 100 and 120. 

4.3 Assessing Feasibility and Compatibility with Other Methods 

The adaptive simulation method is faster than the conventional one whenever 
there are relatively few locations with activity. Besides the sandpile model this 
includes for example the game of life, some Ising models, and models where set 
of “micro agents” moves around on a lattice, e.g. [1]. 

There are already some methods known to speed up the simulation of CA. 
One possibility is parallelization, which may come in different flavors. 

If one has several processors, domain decomposition is a standard approach. 
It gives reasonable speedup numbers as long as the lattice is large. But what one 
does speed up is only the conventional simulation method (which may be slow 
as we have seen above). 
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Fig. 3. Sum (top) and maximum (middle) of absolute numbers of global CA steps and 
CPU running times for the adaptive method (bottom) in seconds per batch: possibly 
a phase transition during the transient period of the sandpile model before (or when?) 
reaching SOC. 




Fig. 4. Relative frequencies of states in quiescent configurations: possibly a phase tran- 
sition during the transient period of the sandpile model before (or when?) reaching 



SOC. 



Another method to process several cell updates simultaneously is to store 
several states in each of the native machine word processed by the CPU. For 
example recently Weimar [5] has reported remarkable results using multispin 
coding. 
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Concerning the latter it is more or less obvious that the adaptive simulation 
algorithm can be used together with it and that one can expect some additional 
savings. 

On the other hand, an efficient application of the adaptive algorithm in a 
setting with a real parallel machine is less trivial. For reasonable results one 
probably needs a CA where there a few activities and these should be spread 
out fairly regularly over the whole lattice. 

5 Conclusion and Outlook 

We have demonstrated that the simple idea of looking only at the active cells and 
their neighbors for the computation of a global CA step can result in impressive 
speedups over the conventional simulation method. Of course, this is only the 
case if there are relatively few active cells. 

Most probably other people have used this approach before us. The main 
contribution here is an elegant description with verification of its correctness. 
During systematic investigations we also discovered what may be a phase tran- 
sition during the initialization phase of the sandpile model as a byproduct. 

We stress again that there are interesting applications having the required 
property of few activities in at least most global steps. We have shown a few mea- 
surements for the sandpile model. Others will be investigated in the future. We 
will also have to have a look at efficient parallelization of the adaptive simulation 
method which is not completely straightforward. 
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Abstract. An experimental analysis of the asynchronous version of the 
“Game of Life” is performed to estimate how topology perturbations 
modify its evolution. We focus on the study of a phase transition from an 
“inactive-sparse phase” to a “labyrinth phase” and produce experimental 
data to quantify these changes as a function of the density of the initial 
configuration, the value of the synchrony rate, and the topology missing- 
link rate. An interpretation of the experimental results is given using the 
hypothesis that initial “germs” colonize the whole lattice and the validity 
of this hypothesis is tested. 



1 Introduction 

Cellular automata were originally introduced by von Neumann in order to study 
the logical properties of self-reproducing machines. Following Ulam’s suggestions, 
the requirements he made for constructing such a machine was the discreteness 
of space using cells, discreteness of time using an external clock, the symmetry 
of the rules governing cells interaction, and the locality of these interactions; it 
resulted in the birth of the cellular automaton (CA) model. In order to make 
the self-reproduction not trivial he also required that the self-reproducing ma- 
chine should be computation-universal (e.g., [13]). The resulting CA used 29 
elementary states for each cell and updates used 5 neighbors. Later on, Conway 
introduced a CA called “Game of Life” or simply Life which was also proved to 
be computation universal [2] . This CA is simpler than von Neumann’s in at least 
two ways: the local rule uses only two states and it can be summarized with by 
sub-rules (birth and death rules). 

However, the question remained open to know what is the importance of 
perfect synchrony on a CA behavior. Indeed, since the first study on the effects 
of asynchronous update carried out by Ingerson and Buvel [6], many criticisms 
have been addressed to the use of the CA as models of natural phenomena. 
Some authors investigated, using various techniques, how synchrony variations 
changed CA qualitative behavior [6], [10], [3], [5], [14], [8]. All studies agree on the 
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fact that for some CA, there are situations in which small changes in the update 
method lead to qualitative changes of the evolution of the CA, thus showing the 
need for further studies of robustness to asynchronism. Similarly, some authors 
investigated the effect of perturbing the topology (i.e., the links between cells) 
in one dimension [11] by adding links, or in two dimensions with small-world 
construction algorithms [15], [9]. Here too, the studies showed that robustness 
to topology changes was a key factor in the CA theory and that some CA showed 
“phase transitions” when varying the intensity of the topology perturbation. 

The aim of this work is to question, in the case of Life, the importance of 
the two hypotheses used in the classical CA paradigm: what happens when the 
CA is no longer perfectly synchronous and when the topology is perturbed? In 
Section 2, we present the model and describe the qualitative behavior induced 
by the introduction of asynchronism and/or topology perturbations. In Section 
3.1, we observe that (i) Life is sensitive to asynchronism; (ii) robust to topology 
perturbations and (iii) that the robustness to asynchronism is increased when the 
topology characteristics become irregular. Section 3.2 is devoted to presenting a 
rigorous experimental validation and exploration of these phenomena for which 
a potential explanation based on the notion of “germ” development is discussed 
and studied in Section 4. 



2 The Model 

Classically, Life is run on a regular subset of Z 2 . For simulation purposes, the 
configurations are finite squares with N x N cells and the neighborhood of each 
cell is constituted of the cell itself and the 8 nearest neighbors (Moore neighbor- 
hood) . We use periodic boundary conditions meaning that all cell position indices 
are taken in Z/iVZ. The type of boundary conditions does play an important 
role at least for small configurations as shown in [4], 

Life belongs to the outer-totalistic (e.g., [12], [11]) class of CA: the local 
transition rule / is specified as a function of the present state q(c) and of the 
number 5i(c) of cells with states 1 in the neighborhood. The Life transition 
function f{q, Sf) can be written: 

/( 0, Si) = 1 if Si = 3 ; /( 0, Si) = 0 otherwise, (birth rule) 

/( 1, Si) = 1 if Si = 2 or Si = 3;/(l, Si) = 0 otherwise. (death rule) 

In the sequel, we consider Life as an asynchronous cellular automaton (ACA) 
acting on a possibly perturbed topology. 

There are several asynchronous dynamics: one may, for example update cells 
one by one in a fixed order from left to right and from bottom to top. This update 
method is called “line-by-line sweep” [14] and it has been shown that this type 
of dynamics introduce spurious behaviors due to the correlation between the 
spatial arrangement of the cells and the spatial ordering of the updates. These 
correlations can only be suppressed with a random updating of the cells. In 
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this work, we choose to examine only one type of asynchronism which consists 
in applying the local rule, for each cell independently, with probability a. The 
parameter a is called the “synchronicity” [5] or the synchrony rate ; one can also 
view it as a parameter that would control the evolution of a probabilistic cellular 
automata (PCA) where the transition function results in applying Life rule with 
a probability a and the identity rule with probability 1 — a. 

We choose to perturb topology by definitely removing links between cells. Let 
Go = (£,Eo) be the oriented graph that represents cells interactions: (c,c') € 
E if and only if c' belongs to neighborhood of c. The graph with perturbed 
topology G = (£, E) is obtained by examining each cell c £ C and, for each 
cell in the neighborhood of c and removing the link (c, d) with a probability e~ ; 
the parameter e~ is called missing-link rate. Note that, as the local function is 
expressed in an outer-totalistic mode, we can still apply it on neighborhoods of 
various sizes. The definition we use induces an implicit choice of behavior in the 
case where a link is missing : the use of S\ in the local rule definition implies 
that the cell will consider missing cells of the neighborhood as being in state 0. 
Other choices would have been possible; for example assuming this state to be 
1 or the current value of the cell itself. 

3 Observations and Measures 

3.1 Qualitative Observations 

Figure 1 shows that the behavior of Life depends on the synchrony rate a: a phase 
with labyrinthine shapes appears when a is lowered. Bersini and Detours studied 
this phenomenon and noticed that the asynchronous (sequential) updating of 
Life was significantly different from the (classical) synchronous version in that 
sense that a “labyrinth phase” (denoted by LP) appeared (see Fig. 1 below). For 
small lattice dimensions, they observed the convergence of this phase to a fixed 
point and concluded that asynchrony had a stabilizing effect on Life [3]. 

The phase transition was then measured with precision by Blok and Berg- 
ersen, who used the final density (i.e, the fraction of l’s sites) as a means of 
quantifying the phase transition. They measured the value a c for which the 
phase transition was to be observed and found a c = 0.91 [5]. They showed that 
the type of phase transition is continuous (or a second-order transition): when a 
is decreased from a = 1.0 to a = a c , no change is observed in terms of the values 
of the average density. When we have a < a c the “labyrinth phase” gradually 
appears and the average density starts increasing in a continuous way. It is thus 
the derivative of the density that shows discontinuity rather than the function 
itself. 

Figure 2 shows that the removal of links between cells does not qualitatively 
perturb the aspect of the final configurations attained. So, according to the 
observation of steady states, synchronous Life seems somehow robust to topology 
perturbations. However, we also noticed that the transients are much shorter in 
presence of topology errors: for TV = 50 x 50, the order of magnitude of transients 
are T = 1000 for e~ — 0 and T = 100 for e~ = 0.1. 
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Fig. 1. Life configurations for IV = 50 x 50, after T = 100 time steps, starting from 
a random configuration of density dim = 0.5. In the sequential updating, cells are 
randomly updated one after another. 
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Fig. 2. Life configurations for synchronous evolution (a = 1.00) with TV = 50 x 50, 
after T = 1000 time steps, starting from a random configuration of density dim = 0.5. 




e"=0 e -=0.05 t~ = 0.10 



Fig. 3. Life configurations for IV = 50 x 50, after T = 1000 time steps, starting from a 
random configuration of density dim = 0.5: (up) a = 0.90 (middle) a = 0.75 (bottom) 
a = 0.50. The figure in the upper-left corner shows that the system is still in a transient 
mode. 
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Figure 3 shows what happens when both asynchronism and topology pertur- 
bations are added. Rows of Fig. 3 display the behavior with a fixed synchrony 
rate a and columns display the behavior with a fixed missing-link rate e~ . We 
see that increasing topology errors from e~ = 0 to e~ = 0.05 makes the phase 
transition occur for a higher value of synchrony rate a c . With a further increase 
from e~ = 0.05 to e~ = 0.10, the phase transition cannot be observed any more, 
at least for the selected values of a. 

This demonstrates that both parameters e~ and a control the phase tran- 
sition between the “inactive-sparse phase” [9] and the “labyrinth phase” (LP). 
The next section is devoted to quantitatively measure the interplay of these two 
control parameters. 



3.2 Quantitative Approach 

To detect the apparition of the labyrinth phase (LP), we need to look at the 
configurations by eye or to choose an appropriate macroscopic measure. Clearly, a 
configuration in LP contains much more l’s than a configuration in the “inactive- 
sparse phase” ([9]). This leads us to quantify the change of behaviour using 
the measurement of the “steady-state density” (i.e. the average density after a 
transient time has elapsed). This method has been chosen by various authors 
(e.g. Blok and Bergersen [5]) and it has been applied to exhaustively study both 
the dynamics [7] and the robustness to asynchronism [8] of one dimensional 
elementary cellular automata. 

We define the steady-state density p{d ln i 1 a) using the sampling algorithm 
defined in [8]: Starting from a random initial configuration constructed with 
a Bernoulli process of parameter G?; n j, we let the AC A evolve with a synchrony 
rate a during a transient time Transient then we measure the value of the density 
during a sampling time Templing- The value of p is the average of the sampled 
densities. 

The sampling operation results in the definition of a function p{d 1VL i 1 a) that 
can be represented in the form of a “sampling surface” . This surface contains part 
of the information on how the behaviour of a CA is affected by asynchronism. 
Figure 4 shows the experimental results obtained for N = 50 x 50, Transient = 
1000 , T samp ii n g = 1000. The transient time Transient is chosen according to the 
observations made in [1] where equivalent transient times were found for greater 
lattice sizes. 

Let us first look at what happens for d ini £ [0.2, 0.8] (right column of Fig. 4). 
The invariance of the surface relatively to the di n ;-axis shows that the macro- 
scopic behavior of Life does not depend on the value of this parameter within 
this range. The upper right corner of Fig. 4 shows that for e~ = 0 (regular 
topology), the phase transition occurs for a c ~ 0.90 as expected [5]. However, 
when increases, experiments show that a c also decreases. This means that the 
settlement of LP becomes more difficult as links are removed; this can be inter- 
preted as an increase of the robustness to asynchrony. We can observe that for 
e~ = 0.10 , the surface is flat and horizontal, which means that the behavior is 
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not anymore perturbed by asynchronism (at least if we consider our observation 
function) . 

The left column of Fig. 4 shows the behavior of Life for d - m ; £ [0,0.2]. We 
observe two different abrupt change of behaviors. On the one hand, there is a 
value of d? ni which separates the “inactive-sparse phase” and LP. On the other 
hand, the value of df ni increases as e~ increases. This means that LP becomes 
more difficult to reach when links are removed; which again can be interpreted 
as a gain of robustness. 

Experiments were held for various lattice sizes and allowed to control that 
the sampling surface aspect was stable with N; we however observed that when 

and a are fixed, the value of df ni is a decreasing function of N. 

All the previous phenomena may be the consequence of multiple intricate 
factors. In the next section, we study the evolution of so called “micro-confi- 
gurations” put in an empty array and propose a first hypothesis in the direction 
of understanding these behaviors. 



4 Micro-Configurations Analysis 

4.1 Experiments 

The observation of the settlement of LP shows that it can develop from very 
localized parts of the lattice and then spreads across the lattice until it fills it 
totally. By analogy with a crystal formation, we call “germs” these particular 
configurations that have the possibility to give birth to an invasive LP. We inves- 
tigate the existence of germs by performing an exhaustive study of the potential 
evolution of micro-configurations, i.e. 3x3 configurations that are placed in an 
empty array. There are 512 such configurations and we experimentally quantify, 
for each one, the probability that a it becomes a germ. Our goal is to infer the 
behavior of the whole structure from the evolution of these micro-configurations. 

Setting the synchrony rate to a = 0.5, we used the following algorithm: 

For every micro-configuration i £ I, (a) we initialize the lattice N x N with i, 
and (b) we let the CA evolve until it reaches a fixed point or until it reaches 
LP. We repeat S = 1000 times operations (a) and (b) for the same initial micro- 
configuration but for a different update histories. We consider that the CA has 
reached LP if the density is greater of equal than a limit density doo = 0.1. 
Indeed, we observed that if the CA was able to multiply the number of l’s from 
the micro-configuration to a constant ratio of the lattice, then it will almost 
surely continue to invade the whole lattice and, asymptotically, reach LP. We 
experimentally obtain the probability P ger m['i] that a configuration i is a germ. 
Grouping micro-configurations by the number k of l’s they contain, we obtain 
an array with 9 entries P ger m(fc) A £ [0, 9], displayed in Table 1. 

Results show that for k < 3,P germ (^) = 0, which means that all such micro- 
configurations always tend to extinction. For n > 3, the probability to reach LP 
increases as n increases. 
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Table 1. Probability P gelln (k) for a micro-configuration to be a germ (in %) as a 
function of the number k of l’s for different missing- link rates e~ (in %). 
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Fig. 5. Probability to reach LP as a function of rfini for different lattice sizes N £ 
{20,30,40,50} and different missing-link rates: t~ = 0, = 0.02, t~ = 0.04 from top 

to bottom. Left column shows theoretical curves calculated with the independent-germ 
hypothesis; the right column shows the actual measurements. 
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4.2 Inferring Some Aspects of the Global Behavior 

Our idea is to use the previous results about germs to extend them to a descrip- 
tion of the global behavior of Life. Unfortunately, we are not able to do that 
in the exact way. In a first approximation, let us assume that we can infer this 
global behavior by approximating the probability Plp for a uniformly initial- 
ized system to reach LP using an “independent-germ hypothesis”: interactions 
between potential germs are neglected and we assume that LP is reached if and 
only if there is at least one cell that gives birth to LP. This results in the appli- 
cation of formula : Plp = 1 — (1 — Plpi) n * n where Plpi is the probability that 
one cell gives birth to LP. We have Plpi = Sfc=o -Pgerm (k) • P app (fc, cpii), where 
Papp(fc, dini) is the probability that a micro-configuration initialized randomly 
with di n i contains k l’s. It is simply obtained by applying the binomial formula: 
Papp(Mini) = (®Hr/(l - d ini ) 9 ~ k . 

Calculated and experimental values of PLp(dini,£ _ ) are given in Fig. 5 and 
show that this assumption is justified in a first approximation even if the pre- 
dictions seem more accurate for small values of e - . 

The germ hypothesis allow us to understand better some of the observed 
behavior. Let us first consider the abrupt change of behavior observed for d; n i £ 
[0,0.20]: this can come from the fact that as G?; n ; increases the probability to 
observe a micro-configuration that contains more l’s increases thus increasing 
the probability to find a germ in the initial configuration. We can also understand 
with this point of view the invariance of the sampling surfaces in the di n i-axis 
with di n i > 0.20 by the fact, observed in Fig. 5 that in this case Plp ~ 1, that is 
the “labyrinth phase” always appears. In the same way, the shift of d[ ni observed 
in Fig. 4 when varying e~ can be explained by the looking at the variations of 
Pgerm(fc) with e~ : we see that all probabilities to reach LP decrease when e~ 
increases. Finally, we are able to qualitatively predict the scaling of df ni with 
the lattice size TV from the plots the function Plp (dim, e _ ) : when dj n ; and Plp 
are relatively small (i.e., Plp < 0.1), we have a linear scaling of Plp with TV 2 ; 
whereas as Plp is close to saturation, there tends to be no variations with TV. 

5 Conclusion 

Experiments have shown that Life's transition from an “inactive-sparse phase” 
to a “labyrinth phase” (LP) is a continuous phase transition dependent on the 
synchrony rate a and whose critical value a c is controlled by the missing-link 
rate e~. As the topology was perturbed (i.e., when e~ increased), the inactive- 
sparse phase domain extends while the LP domain shrinks. The abrupt change 
in behavior according to the values of initial density d; n i was interpreted with the 
hypothesis that the settlement of LP results from the development of “germs” , 
i.e. small configurations that are able to “colonize” the whole lattice. The study 
of the evolution of potential germs from micro-configurations allowed us to start 
to understand the observations and to give some predictions on the probability 
to reach LP starting from a random configuration. One interesting question is 
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to examine whether these observations hold for a large class of CA or if they are 

somehow related to the computational universality of Life. 
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Abstract. A local information measure for a one-dimensional lattice 
system is introduced, and applied to describe the dynamics of one- 
dimensional cellular automata. 



1 Introduction 

Cellular automata (CA) are spatially extended dynamical systems. They have 
been widely used as models of spatial dynamical processes [1]. They have 
also been used as simple systems in which to study the phenomenon of self- 
organization, i.e., the ability of a system to build up structure from homogeneous 
initial conditions. 

In this paper we use concepts from information theory to study the dynamics 
of cellular automata. The field of information theory was founded by Shannon 
in his 1948 paper [2] . The Shannon entropy introduced in [2] is a global quantity 
that measures the average information content per symbol in symbol sequences 
generated from a given distribution, when all correlations are taken into account. 

Since the work of Boltzmann and Gibbs, entropy has been used in physics 
as a measure of the disorder in a system. The connections between physics and 
information theory are strong. For instance, in the framework of Jaynes [3], the 
entropy of a system with many degrees of freedom quantifies the uncertainty 
regarding the microstate of the system. A number of physicists, going back to 
Wheeler’s vision of ”It from Bit”, have argued that the concept of information 
should be used to describe fundamental physical processes (e.g., [4,5]). One area 
where information theory has been particularly important is in the thermody- 
namics of black holes (e.g., [6]). 

In numerical and mathematical studies of self-organizing spatially extended 
systems, the microstate of the system may often be known in great detail or ex- 
actly. In this case, it is useful to define local versions of entropy or information to 
characterize the structure that arises in the time evolution [7,8]. In [8], such local 
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a European Project in the EU FP6-IST-FET Complex Systems Initiative. 
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quantities were utilized to describe structure arising at different length scales in 
continuous spatiotemporal systems. Information flow in spatial dimensions and 
between length scales was also discussed. A number of other authors have also 
investigated information transport in spatially extended dynamical systems [9, 

10,11,12,13,14]. 

If the time evolution is reversible, microscopic entropy is a conserved quan- 
tity. This is the case both for classical systems viewed in terms of phase space 
distributions, and in quantum mechanics. An important question is whether we 
can introduce a local version of the microscopic entropy that is locally conserved. 
In other words, we would like to define a corresponding local information flow 
so that the system obeys a local continuity equation. Since the information or 
entropy depends on the degree of correlation in the system, it is not immediately 
obvious that this can be done. 

Several authors have however suggested that such an approach may be pos- 
sible. This was first proposed in the context of CA by Toffoli [10], who showed 
the local continuity of information flow in a restricted setting of perturbations 
around uncorrelated equilibria of particle conserving reversible CA, such as lat- 
tice gas automata. The continuity equation for information flow in ID CA of 
this paper, which takes correlations into account, was first formulated in [15]. 

The goal of this paper is to provide a mathematically rigourous foundation 
for these statements in the case of one-dimensional reversible and surjective CA 
rules. In Sect. 2 we introduce a local information measure for a ID lattice system 
and discuss its properties. In Sect. 3 we define the information flow and derive 
the local continuity equation. 



2 Local Information 

The Shannon entropy measures the average information gained by observing a 
new symbol from an infinite string being read in some direction, given knowledge 
of the past. The intent of introducing a local information quantity is to measure 
exactly how much information is contained in each symbol. However, the corre- 
lations in a symbol sequence can in general be arbitrarily long. Consequently, it 
is impossible for information to be completely localized. That is, the local infor- 
mation at a particular position i cannot always be correctly computed merely by 
looking at the configuration in a finite neighbourhood of i. The natural approach 
is therefore to define the local information as a limit which converges to a local 
analogue of the Shannon entropy as more and more distant neighbours are taken 
into account, and which can be computed locally when the correlations are finite 
in extent. The speed of convergence depends on the typical length of correlations 
in the system, which could be quantified as in [16]. 

In the following, A is a finite set and A z is the space of all bi-infinite se- 
quences of symbols from A. For a sequence x G A z , we use the notation x l A k 
for the length k + 1 block ( Xi,Xi + i , . . . ,a:,; + fc). A probability measure yi on A z 
is defined by a consistent set of block probabilities [i(ai . . . ai +n ) for all finite 
blocks a,; . . . a,_|_ n of symbols in A and positions i € Z. 
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We define the local information in the following way. Let /z be a probabil- 
ity measure on A z . Define the left local information at position i of x £ A z , 
conditioned on n past symbols, as 

<Sl (*;*) = • (!) 

Then define the left local information at position i by Sl(x; i) = linin^oo S£(x; i), 
and define the quantities Sr and Sr in the same way, but conditioned to the right 
instead, using Finally, define the local information S as |(Sl + Sr). 

The quantity S£(x; i) is the information gained from the symbol at position 
i when only knowledge of the n left symbols are assumed. If /z is Markov there 
is an n such that S£ = Sl- In other cases, correlations are not finite and no 
such n will exist. The information Si,(x; i) depends on the measure /z. However, 
in an ergodic case, the correct probabilities of finite blocks can be recovered 
with probability one from x by estimating the block frequencies. In a numerical 
simulation, this would normally be the case. 

A position i in a sequence has high local information if its symbol is un- 
expected given knowledge of already observed symbols. We can illustrate this 
using two examples of CA dynamics. For some CA, the time evolution can after 
an initial transient be described in terms of domains of some background pat- 
tern together with moving domain boundaries. The background pattern typically 
consists of blocks from a regular language which is invariant under the CA (e.g., 
[17]). The domain boundaries are local configurations which are often periodic 
and propagate with fixed velocity (often called particles), or move at random 
depending on the local context. They may be destroyed, transformed or created 
during collisions. In terms of local information, the background pattern gener- 
ated by the CA will have low information, common particles will haver higher 
information, rarer particles will have even higher and extremely rare events will 
have very high information. 

An example is elementary CA rule 110, using Wolfram’s rule numbering sys- 
tem [18], which generates a background pattern of large spatial and temporal 
periodicity and a large diversity of particles. A space-time diagram and a numer- 
ical estimate of the local information for rule 110 is shown in Fig. 1. As can be 
seen, local information acts as a filter for the space-time diagram and quantifies 
the structure that is built up. 

An example that shows a different type of spatial structure is elementary CA 
rule 18. Here, domain boundaries (kinks) perform random walks and annihilate 
pairwise upon collision [19,20]. The background pattern is in this case the regular 
language consisting of all blocks without consecutive l’s. The CA acts as the 
additive rule 90 on the background pattern. A space-time diagram for rule 18 
along with the corresponding local information is found in Fig. 1. Notice that 
although the background in this case has non-zero entropy it is still filtered out 
by the local information. For both rule 110 and rule 18, the probability measure 
at each time step was estimated from the block frequencies for blocks of length 
14 in a CA run with a sequence of length 10'. The initial measures were iid 
product measures with the probability of a 1 being respectively 0.5 and 0.2. 
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Space-time diagram for rule 18 




Local information. White = 0 bit and Black > 4 bit 




Fig. 1. Space-time diagrams and the corresponding local information S' 14 for elemen- 
tary CA rules 110 and 18. 



For the definition of local information to make sense, we need to check that 
it converges with n. The convergence of the local information S£ to Sl follows 
immediately from results by Khinclrin (Lemma 7.2 and Lemma 7.7) given in 
connection with a proof of the entropy theorem in [21]. When applied in our 
setting, these prove the existence of the local information in the following sense: 
Fix i £ Z. For all n, £ L 1 (ff), and — > Sl(-;*) almost everywhere 

and in the L 1 -norm. By symmetry, the same is true for Sr(-; z). 

Another requirement of a local information measure is that in the case of 
a translation invariant measure /z the average of the local information should 
equal the Shannon entropy h. Write H{X\Y) for the conditional entropy of the 
random variable X given Y. We can then show that E[S^\ = H(xo\xZ.\) and 
-E[Sl] = h. The same is true for Sr and S. 
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The first statement is proved by the calculation 

[ S£dfi,= -J2n(x°_ n ) flog ^:”^ =H(xq\xzI) • (2) 

x o \ Ml . x -n) ) 

The second statement then follows from the L l convergence of local information 
and the fact that H(x o\xZh) — > h [22, Clr. 4.2]. The equality for Sr and S follows 
since H(xo\xZ n ) = H{x o|a;"). 

3 Information Flow 

We now consider the time evolution of the local information introduced in the 
previous section. In this paper, we primarily discuss reversible CA rules (e.g., 
[23]), where the global CA mapping on bi-infinite sequences has a unique inverse. 
Since microscopic classical dynamics in physics is reversible, this is a very impor- 
tant class of CA for modelling physical phenomena. Some well-known physical 
CA models are lattice gas automata [24] aud CA for simulating spin systems such 
as the Q2R rule [25]. Simple nearest-neighbour reversible CA have in particular 
beeu studied by Takesue (e.g., [26]). The elementary reversible CA rule 26 in [26] 
can be considered as the simplest known system with plausible thermodynamic 
behaviour when one interprets its locally conserved quantity as energy. 

We also consider the more general class of surjective cellular automata, where 
the global mapping is onto. In this case, the number of preimages of any infinite 
sequence is bounded [27]. Since there are interesting examples of surjective rules 
among the elementary CA (both additive rules, and more complicated examples 
such as rule 30), but only trivial examples of reversible rules, simple illustrative 
examples of CA behaviour are often from this class. For surjective CA, and thus 
also for reversible CA, the global Shannon entropy (as well as all Renyi entropies) 
is preserved in the time evolution [28,29]. For non-surjective rules, where infinite 
sequences typically have an infinite number of preimages, the entropy decreases 
in time reflecting the shrinking phase space. 

Our aim is to show that the local information introduced above obeys a local 
continuity equation under the time evolution of a CA, or in other words, that 
there is an analogue of the continuity equations which apply to locally conserved 
physical quantities such as charge or particle density. We show how a flow Jl 
that satisfies a continuity equation can be constructed. We consider the case of 
a reversible CA F, but the construction can also be carried out for surjective 
CA (see [30] for details). 

Fix x € A z , and let Sl(M) = <Sl {F t {x)\i). We will show that there is a 
well-defined function Jh(t;i) = Jh(F t (x);i) such that 

A t S h + A: Jl = 0 . (3) 

Here, A is the forward difference operator, so that A t Sj, = Si,(t + 1\ i) — Si,(t; i) 
and AiJ\, = + — Jl(£; *)• With these definitions, Jl(£; i) is the informa- 

tion flow from position i — 1 to position i in iteration t + 1 of the CA. Since the 
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flow is associated with a change between two time steps (just like other currents 
in physics which involve a time derivative), it will naturally involve the state and 
measure of the CA at t + 1 as well as f , even though it can be expressed solely 
in terms of quantities defined at t , 

In the following, we consider a reversible CA F with left radius m, right 
radius r and local map / : A m+r+1 — >. _4. The inverse of a reversible CA is also 
a CA [31]. Let the inverse CA F of F have local map / with left radius M and 
right radius R. Fix x G A z , and let y = F(x). 

In general, the measure y changes in time due to the cellular automaton time 
evolution, unless we happen to start from an invariant measure for the CA. Let 
y° be a measure on and let /x* = -F*(/x°). The evolution of /A is given by the 
standard relation (e.g., [29]) 

n)= E (4) 

f-Hvi-J 



where / _1 (y-_ n ) is the set of all blocks x l A r n _ m that map to y\_ n under /. For 
simplicity of notation, we consider the information flow at spatial coordinate 0 in 
the first iteration of F, but this can trivially be changed to an arbitrary position 
i and an arbitrary time step t to t + 1. 

We also define the joint measure v of two consecutive time steps as the 
measure on ( A x A) z defined by 

v{x°_ n ,y°_ n ) = n°({z r _ n _ m G A m+n+r+1 \z°_ n = x°_ n and f{z r _ n _ m ) = /„}) • 

(5) 



It is easy to show that v actually is a measure, and furthermore that v is 
translation invariant if /i° is translation invariant. Note that by summing over 
all possible y_ n or x_ n we obtain from the definition u(x°_ n ) = y°(x_ n ) and 
v (y-n) = fJ'H.V- n)- 

From the definition of local information we have 

ASl(O) = lim (— log/x 1 (2/ 0 |2/Zi) + log/x°(a;o|x:^)) . (6) 

n . — ^-oo v / \ / 



Adding and subtracting the same terms, we can express this change as 



ASl(O) = lim log 

n— >oo \ 



v{xo,yo |*_i + M»J/-n) , v(x 0 ,yo\xJ n ,y_n +m ) 



^(yoly-n) 



-log- 



M°(a;o|x_„) 




= lim 

n— >oo 




y(x°_ n+M \ y°_ n ) 

v(xzi +M \y-n) 



log 



v{y- n+m \x°_ n ) 

n+m\'*'— n) 



+ 7n 



(7) 



where 



7 n = ~ log 



v(XQ,yo 

v{x 0 ,y 0 \xZ 



(8) 
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converges to zero almost everywhere as n — > oo. This can be proved using mar- 
tingale methods. See [30] for details. 

We can now write 



ASl(O) = lim (- log v(y°_ n+m \x°_ J + log v{y _\ +m \x _\) 

n—too 1 

+ logu(x 0 _ n+M \y°_ n ) -loguixZl+MlyZn)) ■ ( 9 ) 

Let us show that two of these terms have well-defined limits. The other two can 
be resolved in the same way. The first term can be written as 

- lim logu(y°_ n+m \ x -n) = - lim log y° (x\ G B\x°_ n ) (10) 

n — >oo n — >00 

where B C A r is the set of extensions to the right of x that are compatible 
with the image sequence y 0 .^- That is, B = {z[ G A r \f(x^_ m _ r z\) = y°_ r }- The 
right hand side of (10) converges because y°(x\\x°_ n ) converges for each block x\. 
This follows from the convergence of local information. If r > 0 there is at least 
one block for which this conditional probability is non-zero, otherwise x will have 
infinite local information at some position. If r = 0 we define y°(xi G B\x*L n ) 
to be 1. If we denote by Z 0 the set of all preimages zZ^ of y°_ x that satisfies 
2-00 = £- 00 , (10) becomes 



- lim log%_„ +m |a;-J = -log^/^la:^) 



( 11 ) 



Using the reversibility of the CA, we have analogously that 
lim logu(x 0 _ n+M \y 0 _ n ) = lim log y}(yf G C\y°_ n ) 



(12) 



where C C A R is the set of extensions of y®_ ^ to the right that are compatible 
with the preimage x i.^. By the same argument as above the right hand side 
converges. We can also express lim,,.-^ log v(x_ n+M \yZ n ) purely in terms of x 
and fi°. In fact, one can prove that 



lim log/fof €C\y°_ n ) 

n — >00 



log 



Zz 0 »°(* r -R + lls = £) 

Ez l l °( Z -R + 1 l x -oo) 



(13) 



where Z 0 is as above and Z is the set of all preimages of y°_ oa . Since F is 
reversible and R is the right radius of the inverse, all preimages agree on zZ^, 
but might be different to the right of coordinate —R. 

Equations (9), (11) and (13) now enable us to define the locally conserved 
information flow 



where 



Mi) = j+(i) - M(i) 



./{;(*) = - log 5> 0 (zj 

Jl (0 = - log 






) 

— OO/ 



Zo 



Ez 0 W 



0/_*+i — l|_i — .R— 1\ 



E zMzZr 1 ^- 1 ) 



(14) 

(15) 



(16) 
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Fig. 2. Left: The information J L (j) needed to determine y l _ J, which is not found in 
x l ~^ flows to the left across the boundary. Right: The information ( i ) in x^S^ which 

is not present in ylrj, leaks out to the right across the boundary. 



Here, Z are the preimages of y’i^, and Zq are those preimages that satisfies 
z l ~^ = x l Z^- Both J^(i) and J^{i) are non-negative. 

By using these definitions, (9) can be written as the continuity equation 

A t S L + A t J h = 0 (17) 

which is valid at every i and t and for all measures /i°, not necessarily translation 
invariant. 

The expressions J £" and J £ have natural information theoretic interpreta- 
tions. This is more readily apparent if the flow is written in terms of quantities 
defined both at t and t + 1: 

M 1 ) = J£ (1) - J£(l) = lim {-logv(x 0 _ n+M \y°_ n ) +\ogv{y 0 _ Jl+rn \x°_ n )) . 

n—t oo 1 

(18) 

The quantity — log v(y2. n + m \x?_ n ) is the additional information needed to 
uniquely specify y_ n+m when knowing x°_ n . The definitions above can be viewed 
as a choice of boundary conditions in the definition of the flow. In the case of 
AS'l(I), we only want the contribution from information flow across the bound- 
ary between 0 and 1 and not the boundary at —n. Due to the condition on the 
left boundary of the finite strings (knowing m more symbols of x than of y , see 
Fig. 2) the left part of y°_ n+rn is uniquely determined by x°_ n , and we avoid 
the unwanted effect of an information flow across the left border. Therefore, 
the remaining information needed to determine the right part of y°_ n+rn must 
come from the coordinates to the right of 0. Consequently, a flow J£" (1) from 
coordinate 1 to 0 is induced. The limit n — > oo must be taken to ensure that all 
information contained in x about its left prolongation is taken into account. 

The quantity — log v(x°_ n+M |j/° n ) is the additional information needed to 
uniquely specify x_ n+M given knowledge of y_ n . Once again the left boundary 
conditions ensure that the left part of x_ n+M is uniquely determined, see Fig. 2. 
However, not all information about the right part of x°_ n+M is found in y°_ n . This 
missing information leaks out to the coordinates to the right of 0 inducing the 
flow Jj)"(l). Technically, the boundary conditions are obtained by introducing a 
term which vanishes in the limit, see (7). 

By an argument similar to that used for reversible CA, the existence of 
a continuity equation can also be extended to a larger class of CA than the 
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reversible CA. The crux of the argument used to find a continuity equation 
for reversible CA is that all preimages of y°_ n agree on a central block. This is 
also the case for those surjective CA where almost all y £ have exactly one 
preimage. These are the CA with M(F) = 1 in the notation of Hedlund [27]. In 
this case J^(i) has the same expression, but in j£(i) R will be changed to the 
stopping time r, which is the smallest integer such that all preimages of y'L 00 
agree to the left of coordinate i — r. We will present the details of this and other 
cases as well as examples in [30]. 

4 Conclusions 

We have studied local information quantities 5 l and Sr. that measure the local 
information density of a one-dimensional lattice system. These are useful for 
detecting and quantifying structure in a lattice configuration. 

An information flow Jl has been introduced such that a continuity equation 
A t S l + = 0 holds under iteration of a one-dimensional reversible CA. 

We have discussed how the CA generates two different types j£ and J£) of 
information flow satisfying Jl = — </£" . Moreover, we have sketched how an 

expression for Jl may be found in other cases by choosing boundary conditions 
in a suitable way. 

The continuity equation is a fundamental property of information transport 
in reversible systems. But we expect local information in cellular automata to 
have further interesting properties. In particular, a continuity equation in physics 
can be viewed as a constraint, rather than an equation that determines the 
dynamics of the system. In a similar way, one may expect information flow to 
have different dynamic characteristics in different CA. We have seen examples of 
CA building up high information in local configurations. Other CA might show 
diffusive behaviour and smear out initial inhomogeneities in local information. 
Such properties would give much information about the dynamical behaviour of 
the CA. 
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Abstract. A model performing mesh partitioning into computationally 
equivalent mesh parts on regular square lattices using a diffusion con- 
trolled cellular automaton (DCCA) is proposed and studied in this ar- 
ticle. Every processor has assigned a domain seed at the beginning of 
a simulation. Algorithm works with growth of seeds and migration of 
domain borders, the later is triggered when the difference of diffusive 
agents on both sides of a border exceeds a given threshold. The model is 
built using self-organization principles ensuring convergence. Solutions 
are dynamically stable configurations achieved from any initial configu- 
ration. 



1 Introduction 

Mathematical formulation of physical, engineering and other scientific problems 
often leads to a model expressed by a set of Partial Differential Equations (PDE) . 
Numerical formulation of a set of PDE is typically done by the Finite Element 
Method (FEM) . This technique splits a continuous problem - typically without 
any analytic solution - into a finite number of discrete problems called the fi- 
nite elements. The whole problem is then solved by this discrete solution above 
the whole set of finite elements. Fine FE meshes approximates the continuous 
problem more precisely then coarse meshes. Therefore, the data storage require- 
ment and the computational cost rapidly becomes a ’bottleneck’ of a computer 
simulation. This naturally leads to use of parallel computers to solve extra large 
FE meshes. One of the most used techniques - splits a mesh into sub-domains 
where each sub-domain is assigned to different processor - uses large distributed 
memories, and message passing for inter-processor communication. Among all 
parallel algorithms dealing with this problem, we could distinguish three main 
groups of methods: the parallel direct methods, the parallel iterative methods 
and the domain decomposition methods. 

Most of the research had been done on development of mesh partitioning 
algorithms based on a load balancing approach, see for example the papers [1, 
2,3] and references therein. Algorithms with special interest to the aspect ratio 
of the sub-domains were investigated later on. The most efficient mesh parti- 
tioning algorithms are now implemented in program libraries, like [4, 5, 6, 7, 8, 9]. 
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The partitioning problem is known to be NP-complete in general and there is 
no single algorithm that is always capable of producing the best solution. Many 
algorithms are based on heuristics of varying degrees of efficiency, which may 
fail because of special properties of the partitioning problem being considered. 
Most of these algorithms are based on graph theory because the topology of the 
finite element mesh could be represented as a graph. 

Cellular automata (CA) are often represented as discrete dynamical systems 
where space, time and variables are discrete. CA-model discretize space into a 
given number of dimensions - in our case into two-dimensional (2D) lattice of 
squares. The elements of this lattice are called cells. A neighbourhood is defined 
to every cell - a list of the nearest neighbouring cells - that is uniform through 
the whole lattice. Every cell contains a list of variables, e.g. domain number, 
amount of a diffusive agent, etc. The evolution of the system is driven by a 
transition rule that computes new values of variables of an updated cell using 
old values of variables of cells laying in the neighbourhood and values of the 
cell itself taken at the previous CA-step. Comprehensive information about the 
CA can be found in [10,11,12,13], at the web page of the author and at links 
provided there. A good introduction into CA with examples is provided at web 
page [14], 

We propose a novel diffusion controlled cellular automaton (DCCA) model 
performing domain decomposition on square lattices - restricted to square lat- 
tices - into N computationally equal parts. Algorithm works with growth of 
domains from domain seeds using migration of domain borders, the later is trig- 
gered when the difference of the diffusive agent on both sides of a boundary 
exceeds a given threshold. The model is built using self-organization principles 
ensuring convergence. This model performs load balancing only. CA-models of 
dynamic recrystallization [15] and living cells [16] motivate growth of seeds, and 
models of dynamic recrystallization and grain boundary migration [17,18] moti- 
vate migration of domain boundaries. 



2 Cellular Automaton Model 

Explanation of the model is done in three subsequent parts in order to make the 
definition of this model easier to follow. In the first one, the general concept of 
diffusion controlled cellular automata (DCCA) is proposed. In the second one, 
all variables used in the proposed model are defined. Finally, all five sub-steps 
of the DCCA-model are defined. 



General Concept of Diffusion Controlled CA. A diffusion controlled cel- 
lular automaton is built upon several sub-steps: growth of domains from domain 
seeds, diffusion of a diffusive agent, and migration of domain boundaries. All 
starts in domain seeds; the rest of cells is empty. Their number defines the num- 
ber of domains, which are finally assigned to distinct processors. Every domain 
seed initially occupy space of just one cell where the initial amount of the diffu- 
sive agent - conserved during the whole simulation - is stored. 
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Fig. 1 . A figure displaying dispersion of the growth-factor into three directions distr[i] 
for the middle cell under consideration within the first diffusion step 

Growth of a domain is allowed only if the difference of the diffusive agent 
between growing domain and empty cells exceeds a given threshold. If domain 
borders meet then growth is terminated and the migration starts to operate. 
When the difference of diffusive agent on one side exceeds a prescribed thresh- 
old, a migration event occurs, i.e. domain boundary of domain having a higher 
content of the diffusive agent migrates into domain having a lower content. 



Definition of Variables. The algorithm is relatively complicated because one 
complete step is composed of five subsequent sub-steps: growth, migrationi , 
dif fusioni, dif fusion^, migration where the lower index tells that given pro- 
cess is split into two separate parts. One cycle is accomplished only when all five 
sub-steps in the above defined order is accomplished. 

The following variables are used: (a) domain-number assigns an unique num- 
ber to all cells from a given domain or to wall; (b) growth-factor stores the amount 
of growth factor; (c) ditribution[ 0], ditribution[ 1], ditribution[2 ], and ditribution[ 3] 
are temporary variables used to store the fractions of growth-factor split in four 
directions: they are defined as relative translations [-1,0], [0,-1], [1,0], and [0,1]; 
and (d) protect protects a given cell - assumed to migrate - from being migrated 
by other neighbouring cell. A cell under consideration means the cell with the 
relative coordinates [0,0]. 

All variables belonging to every cell are initialized to zero values except seeds. 
Every simulation starts in a number of cells - called domain seeds - that is equal 
to the number of required domains. The amount of growth-factor is strictly 
conserved in every domain. 



Rules Defining DCCA. All five rules defining the DCCA algorithm are 
explained in this subsection. 
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Growth rule G performs growth of domain seeds until all space that could 
be occupied is consumed: 

(a) All cells from the von Neumann neighbourhood (W, S, E, N) of the cell under 
consideration are tested against possibility to grow into the place occupied by 
it. Growth is not allowed into cells marked as the wall cells. 

(b) A neighbouring cell with the greatest content of growth-factor will be allowed 
to grow into the position of cell under the consideration. When two or more 
domains meet, the domain having greatest potential to grow wins. 

This sub-step G is checked during every cycle even in the case when no 
empty cell is available to be occupied by growth of a domain. Cells initialized as 
a wall enables us to insert irregular boundary and holes into the simulated lattice. 

Migration rule Ml tests five conditions for a cell under consideration. 
Variable protect could be set to a value that lock the potential migration sub-step 
against being influenced by other cells until the migration sub-step is finished. 
A migration threshold is defined. It ensures that migration operates above some 
difference of diffusive agents of neighbouring domains: 

(0) Is the difference of the growth-factor of both cells above the threshold? 

(a) If domain-numbers of cells [0,0] and [-1,0] are different => set protect = 10. 

(b) If domain-numbers of cells [0,0] and [0,-1] are different => set protect = 11. 

(c) If domain-numbers of cells [0,0] and [1,0] are different => set protect = 12. 

(d) If domain-numbers of cells [0,0] and [0,1] are different => set protect = 13. 
Condition (0) has to be fulfilled for all cases. Conditions (a), (b), (c), and (d) 
are exclusive. 

Implementation of the sub-step Ml before diffusion sub-steps D1 and 
D2 and the sub-step M2 is fundamental. It is necessary to protect cells that 
are going to be migrated from diffusion of growth-factor inside them. The 
growth-factor belonging to any domain has to be conserved. 

Diffusion rule D1 is implemented by use of a modified finite difference 
scheme of the Laplace operator. One physical diffusion step is split into two 
distinct sub-steps. In the first one Dl, the growth-factor is deposited into four 
directions. In the second one D2, deposited growth-factor is shifted into appro- 
priate neighbouring cells and then one physical diffusion step is finished: 

(a) Number of neighbouring cells belonging to the given domain within von Neu- 
mann neighbourhood is counted (maximum of four). 

(b) The amount of growth-factor is equally divided between all neighbouring cells 
within the von Neumann neighbourhood and the cell itself that belongs to the 
same domain. Remaining amount of the growth-factor after this division stays 
in the cell under consideration. Therefore, the growth-factor is conserved! 

(c) Growth-factor belonging to a neighbouring cell is dispersed into the variable 
distr[i\ that are associated with direction i pointing to this cell. 

A schematic figure displaying preparation of dispersion of the growth-factor 
into three directions distr[i] for a cell under consideration can be seen in 
Fig. 1. Data are stored in the cell that disperses the growth-factor. Please 
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Fig. 2. The resulting time evolution of domain sizes Ai for four domain seeds located 
in the bottom-left corner where domain sizes belonging to top-right and bottom-left 
seeds are marked 



note that a cell that does not belong to given domain is located in the 
south. Therefore, the growth-factor cannot be dispersed into this direction 
denoted by the number equal to 1. Hence, the total amount of diffusive agent is 
divided into four equal parts - three neighbours and the cell under consideration. 

Diffusion rule D2 represents the second finalizing part from two subse- 
quent diffusive steps. Diffusive agent is migrated into destination cells selected 
in diffusive sub-step Dl: 

(a) The amount of growth-factor deposited in temporary variables distr[i ] of 
neighbouring cells - i.e. in directions pointing to this cell belonging to the 
same domain are added into the growth-factor variable. 

Remark: When we collect the growth-factor from east then we take the growth- 
factor from variable distr[( 0 + 2)%4]. The symbol % denotes modulo operation. 

Migration rule M2 is the final sub-step of the whole cycle. It is the second 
part of migration, migration 2. Protected movements of domain border into the 
cell [0,0] are realized in this sub-step: 

(a) If protect = 10 then the domain from direction [-1,0] is migrated into. 

(b) If protect = 11 then the domain from direction [0,-1] is migrated into. 

(c) If protect = 12 then the domain from direction [1,0] is migrated into. 

(d) If protect = 13 then the domain from direction [0,1] is migrated into. 

(e) The variable protect is set to zero. 

Those new potential reorientations in the second migration sub-step M2 
leading to creation of disconnected small domains are rejected. The whole simu- 
lation cycle is accomplished by the second migration sub-step. Only output from 
this sub-step can be used as the numerically correct one. 





136 



J. Kroc 



3 Results and Discussion 

The properties of the CA-model are studied on several examples working with 
four and eight domain seeds and a change of the domain shape. Every sub-step 
of the full simulation step requires different amount of time to be completed. 
Therefore, optimization of the proposed algorithm should be primarily focused 
into the most computationally expensive sub-steps of the algorithm. The algo- 
rithm of growth sub-step G is based on diffusion of the growth-factor - let us 
call it diffusive-growth - that enables us to control achievement of approximately 
globular shape of domains. 





Fig. 3. Snapshots of the evolution of domain shapes taken at six different times in the 
case of eight domain seeds. The simulation starts at the top-left corner and proceeds 
in rows (times 0, lk, 10k, 20k, 60k, and 150k) 



Let us assume for a moment that growth is without restriction of diffusion 
of the diffusive agent - i.e. the pure growth. The good message is that the pure 
growth without diffusion - i.e. non-diffusive-growth - is roughly ten times faster 
compared to the growth with presence of the growth-factor. The bad message 
about the non-diffusive-growth is that it produces rather unrealistic initial do- 
main decompositions - with elongated and narrow domain shapes - requiring 
much longer migration time. The migration sub-step M is much computationally 
expensive than the growth one G. Therefore, the total simulation time is more 
longer for the non-diffusive-growth then for diffusive-growth. 
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There is shown evolution of initial configurations having four domain seeds 
and two different initial configurations with eight domain seeds. The initial stage 
of evolution of domains of four domain seeds is same as in Fig. 3 with four seeds 
located at bottom-left corner before impingement them with other seeds. Those 
seeds located in the bottom-left corner are initial ones, the initial configuration 
is far away from the ideal one. In other words, one domain - the top-right - 
is growing too fast and growth of the others is suppressed by growth of this 
dominant domain. Therefore, simulation spends a long time in migration sub- 
step M and the total simulation time becomes too long. The resulting area Ai 
evolution with respect to time of all four domain sizes can be seen in Fig. 2 where 
the top curve belongs to the domain in the top-right corner. The curve having 
the smallest slope belongs to the domain located in the bottom-left corner, see 
Fig. 3. More ideal positions of domain seeds diminish the simulation time. 




time 



Fig. 4. The resulting time evolution of domain sizes Ai for eight domain seeds located 
in the area without a hole - for the topology in Fig. 3 

A simulation with eight domain seeds is displayed in Figs. 3 and 4 where 
snapshots of the evolution of domain shapes are provided in Fig. 3 and relevant 
evolutions of the domain sizes in Fig. 4. Not surprisingly, the use of a higher 
number of seeds for the same total size of the simulated lattice diminishes the 
total simulation time. It is because domains become smaller. Distances that 
have to be migrated by domain borders are shorter. The final shapes of all 
domains except one located in the up-left corner are convex. Non-convex shapes 
of domains automatically mean longer domain borders. 

The initial positions of domain seeds are not distributed homogeneously. Four 
seeds are located closely in the bottom-left corner and the other four seeds are 
spread across the rest of the lattice. This initial configuration leads to a distorted 
growth of seeds. Some are suppressed to grow by others that are growing too 
fast. 

A simulation with eight domain seeds and a hole located in the middle of the 
lower part of the lattice is displayed in Figs. 5 and 6. Diffusive agent serves as a 
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Fig. 5. The evolution of domain shape at six different times in the case of eight domain 
seeds located in the very same positions as in Fig. 3 and with a hole in the middle of 
the lower part of the lattice. The simulation starts at the top-left corner and proceeds 
in rows (times lk, 2k, 20k, 50k, 80k, and 150k) 



compactor of domains, i.e. it helps to keep them together. Nevertheless, presence 
of holes brings anisotropy into the lattice and slows down the overall diffusivity 
within the lattice. Hence, the total simulation time is substantially longer than 
in the case without presence of the hole. 

The simulation starts in the similar way as in the case of eight domain seeds 
without a hole. Growth of domain seeds quickly fills the whole lattice. The prob- 
lem is that those four domain seeds located in the bottom-left corner are allowed 
to grow and migrate only upwards contrary to the previous case. Therefore, their 
movement is substantially slowed down. Induced anisotropy of the lattice leads 
to production of non-convex domains. One of domains - the second one from 
the up-riglrt corner - has a quite large curvature. 

The CA-model is constructed in such way that domains finally occupy the 
approximately equal volume, and therefore, processors have the same computa- 
tional load. This is done by use of self-organization principles in the CA-model. 
It is easy to see from results that position of the domain seeds at the very begin- 
ning of a simulation strongly affects the total simulation time - by at least one 
order of magnitude - and the final shape of domains. The CA-model proposed in 
this contribution is formulated for the case of regular square lattices having four 
neighbours. Generalizations to trigonal lattices having three neighbours and to 
non-periodic grids is the subject of the future work. 
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Fig. 6. The time evolution of domain sizes for eight domain seeds located in the area 
with the hole - for the very same topology as in Fig. 5. Compare it to the case having 
the very same location of domain seeds but without the hole, Figs. 3 and 4 



It is expected that domain decomposition of complicated topologies using 
DCCA in 3D is too hard for this model. There is development of more sophisti- 
cated models, which could be sufficient to solve any problem in 3D. This model 
should be understood as an introduction into the whole class of new models. 

4 Conclusions 

A novel diffusion controlled cellular automaton (DCCA) model performing do- 
main decomposition on regular square lattices with special attention to load 
balancing is proposed and studied in this work. It is recognized that conver- 
gence speed is the highest in the case when the positions of domain seeds are 
estimated closely to their ideal positions. It was found that presence of holes 
within the lattice leads in the case of bad positions of the initial seeds to a 
dramatic increase of the total simulation time. 
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Abstract. The cellular automata discrete dynamical system is consid- 
ered as the two-stage process: the majority rule for the change in the 
automata state and the rule for the change in topological relations be- 
tween automata. The influence of changing topology to the cooperative 
phenomena, namely zero-temperature ferromagnetic phase transition, is 
observed. 



1 Introduction 

Lattice models are the basic instrument in the study of phase transitions in 
equilibrium statistical mechanics (see, e.g., [1]). The Ising model is the sim- 
plest model of nearest-neighbor ferromagnetic interactions where the collective 
features are studied. Traditionally, the phase transition has been considered in 
regular lattices. According to the renormalization group theory, detailed struc- 
ture of interactions as well as the structure of network connections are irrelevant. 
Therefore, for example, cellular automata have been continuously tested with the 
hope that these systems can imitate nature [2]. 

If the cellular automata are considered on a regular lattice and each site i 
of the lattice is occupied by a binary variable, .s,, which assumes the values ±1, 
then referring to these variables as spins which point either ‘up’ (+1) or ‘down’ 
(— 1), we employ the terminology of the Ising model. The Ising nearest-neighbor 
interaction in case of cellular automata becomes usually a simple majority rule, 
it is: if most of the nearest neighbors of i spin point up (down) at time t then 
the value of the i spin at time t + 1 is up (down). This deterministic rule is 
interpreted as the zero-temperature approximation to the Ising interactions. The 
research here is aimed on the ergodicity, i.e., on the uniqueness of the final state. 
Especially, one asks if starting from a random distribution of spins states one 
ends up with a homogeneous states of all spins up or all spins down. To mimic the 
temperature effects one adds a stochastic noise to perturb with some probability 
the execution of the deterministic dynamics. The research here is focused on 
properties of the transition from the non-ordered phase to the ordered phase - 
ferromagnetic phase transition. 

Recent research in the structure and topology of real networks [3,4] has shown 
that social, biological, technological networks are far from being regular. How- 
ever, they also are being far from a random network. It has been shown that 
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networks called small-world networks and scale-free networks exhibit mixed prop- 
erties of regular lattices and a random graph [3,5]. This has trigged to study 
of standard models of statistical mechanics in these networks. The transition 
from non-ordered phase to the ordered phase in the spin system spanned on 
the small-world network shows a change in behavior from the regular case to 
the mean-field characteristics [7,12]. Interesting that, depending on the network 
construction, the phase transition at the finite temperature exists or not [6]. 
Moreover, when the scale- free network is employed then the critical properties 
of the Ising model are different from those ones observed in the regular networks 
[13,14]. Many authors have considered other problems in these new networks: 
percolation properties [8], the spread of infection diseases [9] social relations [10], 
computer organization [11]. 

Small-world networks are intermediates between regular lattices and the ran- 
dom graph. A small-world network is generated by rewiring with a probability p 
the links of a regular lattice by long-distance random links [5] . The presence of 
a small fraction of ‘short-cuts’ connecting otherwise distant points, drastically 
reduces the average shortest distance between any pair of nodes in network. 

In case of cellular automata such the small-world network means that the 
cellular automata is no longer homogeneous — inlromogeneity is due to the 
topology. Sets of nearest-neiglrbors differ from a site to a site. 

There is one important property which is ubiquitous in biological and artifi- 
cial networks and which is missed in the small-world network the distribution 
of the vertex degree does not show wings decaying as a power-law. Two important 
ingredients have been shown to be sufficient to generate such feature: growing 
number of vertices and preferential attachment of links. The well established 
Barabasi- Albert model based on these two mechanisms has been proposed [3]. 
The distributions of the vertex degree in this network is of the form P(k) oc fc -7 
where 7 £ [2,3] and k a vertex degree. Thus adapting the Barabasi-Albert idea 
to cellular automata one obtains a computing system with the topology that 
evolves. 

In this paper we address the question of the role played by the topology 
in the zero-temperature ferromagnetic transition considered in the cellular au- 
tomata of spins. The starting system is the cellular automata on a square lattice. 
The square lattice used here is strongly clustered in the sense that if j and k 
are neighbors of i then there is a short path between them that does not pass 
through i. In order to keep the high clustered property (typical for the small- 
world network) and reproduce the distribution of the vertex degree with the 
power-low decay (sign of the scale-free network) we propose the evolution rule 
for the network. Hence, the cellular automata time step means the change of the 
spin states and the change of the network topology. It will be shown that with 
increasing p- the rewiring parameter, the cellular automata work as the solver 
to the density classification task [15], highly accurately, namely our cellular au- 
tomata converge to a fixed point of all l’s if the initial configuration contains 
more l’s than -l’s by 2%, and, symmetrically, to a fixed point of all -l’s if the 
initial configuration contains more -l’s than l’s by 2%. 
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2 Model Description 

We present the model of the spin cellular automata located in N vertices among 
which there is an explicit integer parameter Nq fixing the total number of edges. 
These N spins are located to form a square lattice with the periodic boundary 
conditions. Hence, if the linear size of the lattice is L then N = L x L and 
N 0 = 4 N. We start with the ordinary nearest-neighbor relations, i.e., for any i 
site the nearest-neighbor spins are located in the following set of vertex indices: 
N(i, 0) = {i — L, i — 1, i, i + 1, i + L}. Initially, each spin state is set up randomly 
with the probability p. 

The evolution step consists of the two subsequent steps: (A) the asynchronous 
stochastic evolution of topological relations and (B) the synchronous determin- 
istic majority rule. 

2.1 Evolution of the Lattice 

In each time-step t and for each vertex i the following rearrangement of the edges 
is performed: from the set of neighbors N(i, t) of the vertex i (at the given time 
t) one element j is chosen at random with the probability p. Thus p relates to 
the parameter of the small- world evolution [5]. Then, another vertex k is picked 
up from the set of all vertices N — {?} randomly. A new edge is created between i 
and k independently of the fact that there was or not an edge between them. The 
edge between i and j is deleted. Hence the total number of edges N 0 is conserved. 
This evolution we will call stochastic on the contrary to the evolution in which 
some preferences are applied. The proposed preferences are in the agreement 
with the basic conviction how to obtain a scale free distribution for the vertex 
degree: ‘the richer you are, the richer you get 1 — principle [3]. 

The preferences are as follows: in each time step t and for each vertex i, from 
the set of neighbors N(i,t) an element j is chosen with the probability modified 
by the degree of j vertex. Namely, the probability that j neighbor is unlinked is 

a) 

where deg(j) denotes the degree of j vertex, T denotes the value of the unlink 
threshold. The degree of a vertex denotes the number of edges attached to the 
vertex. By (1) if deg(j) > T then it is impossible to unlink j. This process will 
be called the intensional detachment. The edge between i and j is deleted. Then 
a new vertex is randomly chosen but with the preference to link to a vertex with 
the high vertex degree. It is, a randomly picked up vertex k is accepted with 
probability 

deg(k) 



Thus if deg(k) > T then randomly chosen k is certainly accepted. This process 
will be called the preferential attachment. 

In the following we will name: 
model 0 : the completely stochastic edge evolution, 
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model 1 : the intensional detachment and the stochastic attachment, 
model 2 : the stochastic detachment and the preferential attachment, 
model 3 : the intensional detachment and the preferential attachment. 

2.2 Evolution of Spins 

Synchronously, in each time step, the majority rule is applied to set the state of 
every spin in the next time-step. The rule acts on all spins belonging to the the 
set of nearest neighbors N(i,t) of the given itli spin. In the case when the result 
of voting is zero, the itli spin does not change the state. 

3 Results 

The system simulated is a square lattice with L = 200 and periodic boundary 
conditions. We start with a random distribution of spin states with probability 
p to set the spin state to +1. In each time step we measure magnetization - the 
sum of all spins states normalized by L 2 . If the magnetization is 1 or the time 
step reaches the limit of the hundredth step, then we stop the evolution and 
record the following characteristics: magnetization, susceptibility (the variation 
of the magnetization), vertex degree and number of time steps to reach the 
stable state. The experiment is repeated 200 times for each p and p value. The 
threshold value for the preferences, see (1), (2), is T = 8 in all experiments. 

3.1 Final State Characteristic 

The sharp change in properties of the final state of cellular automata is ob- 
served when p is crossing 0.5, see Fig.l, Fig. 2. The final state switches between 
the state of all spins down if p < 0.5 (magnetization = —1) to the state of 
all spins up if p > 0.5 (magnetization = 1), Fig.l. Depending on the model of 
the network evolution this transformation goes fast: model 0, or less rapidly: 
model 3. The transformation can be estimated by measuring the intervals of p 
around the 0.5- value which lead to the final states with with magnetization 
close to ±1, see Fig.l. More accurate marks for these intervals can be found by 
observing susceptibility, see Fig. 2. In case of unperturbed network (p = 0) the 
susceptibility takes the parabola-like shape on the log-plots. With increasing p 
the suscpetibility becomes the sharply pointed curve with maximum at p = 0.5. 
The basic step in p in the experiments performed is Ap = 0.02. Therefore the 
peaks of susceptibilities are recognized within the interval (0.48,0.52). In the 
small windows of Fig. 2 one can observe how fast and which way the parabola 
transforms into the sharply pointed curves. The significant qualitative change 
occurs when p changes from p = 0.01 to p = 0.02. In particular, if one assumes 
that the interval of critical changes means the susceptibility greater than 1 (at 
average this condition is equivalent to the demand of the absolute magnetization 
being smaller than 0.9), then one finds: 

p £ (0.30,0.70) in case of cellular automata without the network evolution 
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initial probability for a spin to be UP initial probability for a spin to be UP 



Fig. 1. Magnetization of final configurations vs initial probability for a spin to be UP 
( p ) for different values of short-cut edges (p) what is indicated by curve’s labels: (A) 
model 0 - stochastic linking and unlinking, (B) model 1 - intentional unlinking, (C) 
model 2 - preferential linking to, (D) model 3 - intentional unlinking and preferential 
linking to. The large frames present the general dependences on p and p; the inside 
frames show the change of magnetization if p is close to 0.5 and p is small. The lines 
without point markers are results for p = 0.001, 0.002, . . . , 0.01 subsequently. 



p £ (0.48, 0.52) in case of cellular automata with the network evolving ac- 
cording to the rules of models 0,1,2 

p £ (0.46, 0.54) in case of cellular automata with the network evolving ac- 
cording to the rule of model 3. 

Because of the observed rapid changes in main characteristics one can say that 
the cellular automata solve the density classification task. Especially, model 0 
solves the density classification problem extremely efficiently: quickly and with 
the high certainty. 

Also time needed to reach the fixed point stabilization changes when p is 
increasing. If p > 0.1 and p << 0.5 then the time to stabilization is larger than 
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Fig. 2. Log plots of susceptibility of final configurations vs initial probability for a 
spin to be UP (p) for different values of short-cut edges ( p ) what is indicated by 
curve’s labels: (A) model 0 - stochastic linking and unlinking, (B) model 1 - intentional 
unlinking, (C) model 2 - preferential linking to, (D) model 3 - intentional unlinking 
and preferential linking to. The large frames present the general dependences on p and 
p; the inside frames show the change of susceptibility if p is close to 0.5 and p is small. 
The lines without point markers are results for p = 0.001, 0.002, . . . , 0.01 subsequently. 



in case of the system with not evolving network. However, the stabilization is 
reached in less than 40 steps. If p € (0.4, 0.6) and p is small, p < 0.01, then the 
fixed point stabilization is not observed in less than 100 steps. The properties 
shown in figures collect features found as snap-shots of cellular automata states 
at the one hundredth step. This limitation does not introduce any important 
restriction. It is because that when the network does not evolve then, due to 
the many invariants of the spin-state rule, one observes the stabilizations on 
configurations which oscillate with some time period. This results in that the 
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stabilization time is read as t = 100. Such limit configurations occur also if p 
is small, i.e. , p < 0.01. When the rewiring process is stronger the oscillating 
patterns disappear in less than 100 steps. The stabilization different from the 
fixed points is not observed if p > 0.01. 

3.2 Degree Distribution 

The vertex degree distributions are presented in Fig. 3. The plots collect results 
recorded when the cellular automata systems arrive to the limit of either a ferro 
fixed point stabilization or a hundred step of time. Hence the total number of 
changes in the network depends on the time given to the evolution. However, 
if p > 0.01 then after a hundredth steps the average probability for an edge to 
change is 1. Thus, the difference between a network state after a hundred steps 
with p = 0.01 and after the first step with p = 1 consists in the synchronousness 
of events: with increasing p the edge evolution becomes more synchronous. 

All plots of the vertex degree distribution with p < 0.01 characterize the 
network configurations that are obtained at the one hundredth step. In case of 
model 0 the fast convergence to the Gaussian distribution (the parabola shape in 
the log-plot) centered at k = 4 is observed. The significant distinction from the 
Gaussian shape in cases of other models is due to the preferences introduced. 
One observes vertices with the degree higher than 20 which in the Gaussian 
world do not exist. Because of the threshold value considered here, T = 8, the 
vertices with the degree larger than 8 are protected from unlinking and preferred 
in linking. Therefore the number of vertices with high degree is growing with 
increasing p. However, the presented distributions do not have wings depending 
polynomially on the vertex degree, namely, decaying as fc _T for some 7 and k 
vertex degree. 

4 Concluding Remarks 

We have considered properties of spin cellular automata initially spanned on 
the regular square lattice and then rewired each time step systematically. What 
we have observed is the final spin state. We have concentrated on the two con- 
figurations: all spins up and all spins down. It has appeared that only these 
two configurations are possible as the final configuration for spins. Which one of 
these two emerges is depended on p the density of 1 in the initial configuration. 
Due to the changes in the network the interval of uncertainty about the final 
configuration contracts sharply to the small interval around the critical value of 
p = 0.5. From the statistical physics point of view main investigations concern 
the ergodicity problem, i.e. the memory about the initial state properties in the 
evolving systems. The strong connectivity arisen from the small- world network 
enhances sharply the solution. The cellular automata considered are ergodic out- 
side the interval p £ (0.48, 0.52) what means that the interval of ergodicity is 
definitely larger than the interval of p in case of the ordinary spin cellular au- 
tomata. Therefore we claim that the systems considered can work as solvers of 
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vertex degree 



vertex degree 



Fig. 3. Log plots of vertex degree in final configurations vs initial probability for a 
spin to be UP (p) for different values of short-cut edges ( p ) what is indicated by 
curve’s labels: (A) model 0 - stochastic linking and unlinking, (B) model 1 - intentional 
unlinking, (C) model 2 - preferential linking to, (D) model 3 - intentional unlinking 
and preferential linking to. The gray lines are results for p = 0.001,0.002, . . . ,0.009 
subsequently; the dotted lines are results for p = 0.2, 0.3, . . . , 0.9 subsequently. 



the density task. Moreover, since the analysed properties have been collected at 
time moments much smaller than the system size, the stated features seem to 
be universal in the sense that they are size independent. 

Our work is only preliminary. The dynamical system considered here needs 
further investigations. We have studied the zero-temperature Ising problem. Now 
the study of stochastic spin evolution should be undertaken. Moreover, the net- 
works considered here can bee included into the one thermodynamic ensemble, 
so-called canonical ensemble, because all networks have the same number of 
edges [16]. For the canonical ensemble there is a possibility to fix a temperature 
and associate the energy to every network configuration. Such approach states 
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the new question: how this temperature and this energy relate to the ordinary 
thermodynamic characteristics of spin system. 
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Abstract. This paper discuss a class of 2-dimensional asynchronous cel- 
lular automata with conservation of mass, for the formation of patterns 
in groups. The previous study [14] reported a methodology of searching, 
automatically, for pattern-forming cellular automata using a genetic al- 
gorithm; this approach successfully found a few types of pattern-forming 
rules. The current study is a series of statistical analyses of one of the 
classes found by the above methodology, with the hope of understanding 
the mechanisms of the pattern formation. These analyses lead to some 
basic logic necessary to the pattern formation, but not to enough infor- 
mation to elucidate the whole mechanism of the pattern formation. This 
result suggests that the existence of unidentified cooperative operations 
between the different transitions of the cellular automaton rule to carry 
out the pattern formation. 



1 Introduction 

Nature produces many kinds of patterns, and the mechanism of such pattern 
formations are often unknown. Cellular automaton (CA) is a mathematical tool 
suitable for such studies and has been exploited intensively by many researchers. 
Because this mathematical tool is especially suitable for visual analyses of the 
results of calculation, the self-organization of spatial patterns is a very common 
topic among studies. For instance, the hodgepodge machine [1], and the cyclic cel- 
lular automata[2] give intricate patterns and are reminiscent of real phenomena 
of some chemical reactions. The formation of crystal-like pattern is also studied 
by a class of CAs [13]. 

In common CAs, all of the cell states are updated synchronously, and the 
patterns formed by such CAs are usually heavily dependent on the synchronous 
timing of the events occurring at each cell. This means that such pattern for- 
mations are sensitive to a tiny delay of the events at each cell. Such perturba- 
tions always exist in the natural environment, yet self-organization observed in 
real physical, chemical, or biological systems is robust and not dependent upon 
synchronous timing. Several scientists have already started questioning the as- 
sumption of the synchronous adiustment of random boolean networks[7,3,4l and 
CAs [8,12,11,16, 14] . 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 151—160, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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Fig. 1 . The three kinds of patterns found by using genetic algorithms: (a) checkerboard, 
(b)stripe, and (c)sand-likc. 



Applications of artificial dynamical systems, such as random boolean net- 
works or CAs, to the study of spatial pattern formations are expected to re- 
veal the mechanism of advanced self-organizations frequently seen in biochem- 
ical systems and sometimes even in solid-state physical systems such as self- 
protection, self-replication, self-maintenance, etc. However, the discrepancies, 
discussed above, between artificial and natural systems may spoil the fruit 
yielded by such studies. 

In this paper, we will consider asynchronous CAs leading to a certain spatial 
order. As an example we will discuss pattern formations in 2-dimensional CAs 
defined on a regular lattice, and a genetic algorithm will be applied to searching, 
automatically, for pattern- forming CAs. Techniques to search for CAs using a 
genetic algorithm were exclusively studied for the collective behaviors such as a 
density task[16, 10] , a classification [6], and a period three behavior[9]. However, 
few studies have been done for searching for spatial pattern formations. For 
this purpose, we will consider a kind of interactive particle system (or a kind 
of lattice gas), in which the total number of non-zero cells is conserved. This 
mass-conservative condition is essential to the searching methodology, because 
this eliminates CAs which unnaturally generate and annihilate particles to form 
spatial patterns; such CAs are meaningless for the study of pattern-forming 
mechanisms. 

In the previous work [14], three types of pattern- forming CAs, checkerboard, 
stripe, and sand-like , were found (See Fig. 1). This paper describes a follow- 
up study that statistically analyzes the rules found by the above methodology 
hoping that we can obtain some fundamental mechanism related to pattern 
formations. The analyses in this paper are focused on the checkerboard type 
out of the patterns mentioned above; this pattern is already discussed in [14] in 
depth from the phase transition’s point of view. 

In Sections 2 and 3, the mathematical structure of the CA applied to this 
study and the methodology to search for pattern-forming rules are explained, 
respectively. Section 4 shows the results of the statistical analyses, and some 
related discussions are given. The conclusion is given in Section 5. 
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2 Asynchronous Cellular Automata with Mass 
Conservation 

Cellular automata are dynamical systems in which the current state of each cell 
is determined by the states of its neighboring cells at the previous time step. We 
consider, in this paper, an interactive particle system modeled by a CA having 
two kinds of cells, occupied and empty cells. In common cellular automaton 
schemes, each cell has its own neighborhood and its state is determined regardless 
of what states its neighboring cells will take. In other words, the update rule of 
a cell is a many-to-one mapping, which is why the total mass is not conserved 
in such schemes. One technique employed to realize mass-conservative dynamics 
within CA is to divide the whole cellular space into many blocks composed of 
several cells with the states of each block being updated with the total mass in the 
block conserved. In this case the rule becomes a many-to-many mapping. This 
technique is called partitioning, and such a rule is called a block rule. Margolus 
neighborhood is a typical method of partitioning and has been applied to various 
CAs with the above conservation laws [15]. 

While the partitioning technique has been applied mainly to synchronous 
CAs, for example Fredkin’s Billiard Ball Machine[5], we will, in this study, apply 
the technique to asynchronous updating schemes: The position of the block, 
composed of several adjacent cells to be updated, is randomly selected and the 
states of cells in the selected block are updated while the remaining cells are 
left unchanged. From now on, we will simply call this block ‘neighborhood’. The 
rule applied to neighborhood is constant over time, just as it is for the common 
synchronous scheme. See Fig. 2 for a schematic picture of the asynchronous 
partitioning in the case of von Neumann-type neighborhood being adopted. We 
will concentrate on this type of the neighborhood throughout this paper. 



t=k+1 




Fig. 2. Asynchronous partitioning cellular automata: A set of 5 neighbpring cells, ran- 
domly selected, are updated in the order indicated by the arrows. 



Because the influence of a single update to the dynamics is much smaller 
than that of an update in the common synchronous method, the unit time is 
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newly defined as being composed of 



X max ' Vmax 
A neighbor 



asynchronous updates, where 



N neighbor is the total number of neighbor cells, i.e. five for the von Neumann 
neighborhood; x max {= 100) and y ma x( = 100) are the dimensions of the cellular 
space. With this new definition, a cell is, on average, updated once in a unit 
time step, which is consistent with the common synchronous updating schemes. 
Note that some cells may not be updated in a unit time step, while others may 
be done more than once. 

We adopt, in this study, a commonly used torus- type boundary condition in 
which the top and left edges of the cellular space are connected to the bottom 
and right ones, respectively. The state of each cell is assumed to take an integer 
equal to either nil or unity. The nil state denotes an empty cell and is depicted in 
our graphs as a white cell, while the other state is one occupied by a particle and 
is depicted in our graphs as a black cell; i.e. the dynamical system is regarded 
as a mono-particle artificial chemical system or a kind of lattice-gas system. The 
particles merely move around the cellular space neither separating into two new 
pieces, nor merging with others to form a new particle. To satisfy this mass 
conservation law, the update rule must be defined such that the total number 
of particles in neighborhood does not vary. 

The initial condition of each run is given by placing particles randomly on 
the cellular space with a specified “density”, which is the ratio of the number 
of particles to the total number of cells in the cellular space. We here set the 
density at 0.5. For the random number generations, a unique randomizing seed 
is used for each space tried, ensuring that the initial configurations are always 
different. 



3 A Methodology to Search Automatically for 
Pattern-Forming Cellular Automata 

We, in this paper, measure the degree of pattern formation by the spatial entropy 
of global cellular states. Thus, finding pattern-forming CAs means finding those 
with the spatial entropy being relatively small. The space of CA rules is discrete 
and two closely similar rules do not necessarily produce a similar development. 
The optimization of cellular automaton rules therefore tends to become a messy 
problem; classical tools such as steepest descent method are not applicable. Ge- 
netic algorithms are, however, useful for problems in which the function to be 
minimized (or maximized) has a rugged surface, and we apply this technique to 
our problem as described below. 

Each rule in consideration is composed of 2 Nnei3hbor = 32 mappings. The 
graphic patterns of all the 32 entries of the mapping are shown in Fig. 3. Be- 
cause such graphic patterns can be indexed by integers, the rules of our interest 
can be represented by the array of 32 integers, say T[i]{i = 0, 1, ... , 31). This 
integer array becomes the chromosome of this genetic algorithm. The value of 
T[i], i.e. the output of v/tli entry assumes an integer between 0 and 31, but as 
described above each transition must satisfy the mass-conservative condition, 
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Fig. 3. All the entries of the transitions of the cellular automata. 



and the transitions changing the mass inside the neighborhood are prohibited; 
then the whole rule space is approximately composed of ~ 10 14 rules, which is 
much smaller than that of the general rule space of 32 32 rules. 

The algorithm of the searching is: 

1. Choose M chromosomes, i.e. mass-conservative cellular automaton rules at 
random, where M is called a population. 

2. Run all of these CAs. 

3. If at least one of the CAs satisfies a condition of being optimized (see below 
for the detail of the condition), stop the program; continue otherwise. 

4. Calculate the fitness functions (given below) of all the CAs. 

5. Choose the M e ( 0 < M e < M) best-fit CAs for the next generation; they are 
called elites. 

6. Choose M n pairs of individuals at random out of the remaining CAs for 
the next generation so that the total of the elite and non-elite individuals 
becomes M again in the next generation, i.e. M = M e + 2 M n . 

7. Cross over each pair of non-elites at a randomly chosen point with probabil- 
ities of P c , crossover rate. 

8. Mutate each non-elites with probabilities of P m per a transition. Note that 
new transitions also satisfy the mass-conservative condition. 

9. Go to Step 2. 
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Table 1. The parameters of the genetic algorithms 



symbol 


explanation 


value 


M 


population 


20 


M e 


the total number of the elite 


6 


Pc 


crossover rate 


0.5 


Pm 


mutation rate 


0.05 


H° 


the target value of H s 


0.57 



The fitness function is defined as: 



F = ( 



1 



) • t c , 



(1) 



where H s (t ) is a spatial entropy of global cellular states at the time step t (A 
rigorous definition of H s is given in Appendix A.); H° is the target value of 
optimization of H s ; t c is the transition time, that is, the time when the time 
development of H s becomes almost steady. The length of t c is determined by 
the sequential tests of 



\H s (t+l)-H s (t)\<e, 
\H s (t + 2) — H s (t +1)| < e, 

\H s (t + 3)-H s (t + 2)\ < e. 



(2) 

( 3 ) 

( 4 ) 



The smallest t, satisfying all of eqs.(2)-(4) was considered as t c , where e was 
empirically set at 0.003. In case a pattern formation occurs, the transition time 
(i.e. t c ) is long, For this reason, the multiplication by t c in eq.(l) was added 



to a simple 



1 

H s (t) — H° ’ 



and it was empirically found that this multiplication 



improved the optimization speed. The condition to stop the program is when at 
least one of the individual was found H s (t c ) < H°. The parameters for running 
the genetic algorithm are summarized in Table 1. 



4 Statistical Analyses of the Rules Found by the Genetic 
Algorithm 

Many runs of the optimization trial by the genetic algorithm were conducted, 
and the rules that satisfied the condition were collected. The number of gener- 
ations needed to reach the condition of being optimized is sometimes less than 
a hundred, but a few other cases reached 300 generations at which the program 
gave up the further optimization and went to a next trial. The spatial patterns 
that the rules found by the searching give are the same as those in [14], i.e. , 
(a) checkerboard, (b)stripe, and (c)sand-like pattern (see Fig.l). In the following, 
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Table 2. The Shannon entropy of the selected transitions 



entry 


entropy 


entry 


entropy 


entry 


entropy 


entry 


entropy 


0 


0.0 


8 


0.72 


16 


0.11 


24 


0.67 


1 


0.71 


9 


0.99 


17 


0.66 


25 


0.98 


2 


0.70 


10 


1.00 


18 


0.66 


26 


0.99 


3 


0.98 


11 


0.65 


19 


0.98 


27 


0.73 


4 


0.69 


12 


0.99 


20 


0.68 


28 


0.98 


5 


0.99 


13 


0.66 


21 


0.99 


29 


0.71 


6 


0.98 


14 


0.69 


22 


0.98 


30 


0.64 


7 


0.67 


15 


0.10 


23 


0.71 


31 


0.0 



we will concentrate on the analyses only of the rules that develop the checker- 
board pattern. 

The total number of the collected checkerboard rules is ~ 1000. For these 
rules, the probability of what transition is selected through the optimization 
process, say Pij, were calculated, where i and j are indices for the entry and the 
output of the transition, respectively. Using pij, the Shannon entropy of pij for 
each entry i was calculated; the results are shown in Table 2. 

From the entropy values, it is possible to classify each entry into 4 groups: 

I Entry-0 and Entry-31 (The entropy is exactly nil; each entry has only one 
possible transition because of the mass-conservative restriction.), 

II Entry-15 and Entry-16 (The entropy is ~ 0.1; the entry pattern already 
matches the checkerboard pattern and the identical mapping is dominantly 
selected.), 

III Entry-3, Entry-5, Entry-6, Entry-9, Entry- 10, Entry-12, Entry-19, Entry-21, 
Entry-22, Entry-25, Entry-26, and Entry-28 (The entropy is almost unity, 
thus the transition is almost randomly selected. From Fig. 3, we can rec- 
ognize that these entries are not directly related to the creation of the 
checkerboard pattern.), and 

IV The remaining entries (The entropy is in the range of 0.64 — 0.73; The 
further analysis will be given below.). 

The key point of the above grouping is whether or not the entry is one of the 
matching patterns of which the checkerboard pattern is composed, that is, 15tlr 
and 16th graphic patterns in Fig. 3. 

The most-frequently selected transition for each entry and its probability, 
sup pi j , were also calculated (See Table 3). From Table 3, we notice that Group 

i 

IV cau be classified into two subgroups: 

IVa Entry-1, Entry-2, Entry-4, Entry-8, Entry-23, Entry-27, Entry-29, and 
Entry-30 (The value of sup Pi t3 is around 0.6 — 0.7; The patterns for these 

i 

entries are possible to transit to either of the two matching patterns of 
which the checkerboard pattern is composed, and such transitions are most- 
frequently selected.), and 
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Table 3. Transition most-frequently selected by the optimization; the values in the 
parenthesis is its probability (i.e. . supp;j.) 
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Fig. 4. The time development of the cellular automaton given by the rule in Table 3. 



IVb Entry-7, Entry-11, Entry-13, Entry-14, Entry-18, Entry-20, and Entry-24 
(The value of sup p h j is ~ 0.3; The patterns of these entries are close to one 

i 

of the matching patterns, but the transition to them are prohibited under 
the mass-conservative condition; the identical mappings in these entries are 
most-frequently selected.). 

Through the above classification, we seem to understand how each transition 
is selected to achieve the checkerboard pattern. Natural question is then; “Is this 
all about the tricks of the pattern formation?”. If so, a rule created by putting 
all the most-frequently selected transitions of Table 3 together must have a 
performance of making the checkerboard pattern as good as that given by the 
“raw” rules obtained by the optimization. Figure 4 gives the time development of 
the CA composed of the most-frequently selected transitions. The graph at t = 25 
has the spatial entropy of ~ 0.72, much larger than 0.57, the optimization target. 
The configuration of the cellular states at t = 25 (Fig. 4b)is almost identical to 
that at t = 50 (Fig. 4c), and the spatial entropy of the cellular states are not 
improved even in the further time-development. Such a poor performance does 
not meaningfully vary with the initial condition of the cellular states, so this 
rule’s performance cannot compare with that of the rule composed of a raw set 
of transitions found by the searching. See Fig. 1(a); this graph is a cellular states 
developed by a typical rule found by the searching. 
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Although the statistical analyses given above seem theoretically sound, these 
are not enough to elucidate the whole mechanism of the pattern formation. In 
other words, the genetic algorithm selects each transition more cleverly than 
we can intuitively do. This result is not, however, surprising because complex 
phenomena such as pattern formations cannot be decomposed into a simple 
addition of many simple actions, but are organized by combinations or sometimes 
networks of them. 

5 Conclusion 

As a study of self-organization, the spatial pattern formation of CAs was stud- 
ied. Especially, to seek the pattern formation more consistent to Nature, asyn- 
chronous CAs under the mass conservation law was explored and the asyn- 
chronous partitioning scheme was applied for this purpose. A methodology for 
automatically searching for the pattern-forming CAs was introduced, and its 
performance was confirmed. The methodology can be extended to more compli- 
cated situations for more realistic simulations of natural phenomena. 

In addition, the rules giving the checkerboard pattern were collected by using 
the searching method for the purpose of their statistical analyses. The proba- 
bility how often a certain transition is selected by the searching was discussed. 
It was possible to explain why certain transitions are dominantly selected by 
the searching, but the success of such analysis was limited because it failed to 
explain all the mechanisms of the pattern formation. This indicated that the 
checkerboard-pattern formation cannot be explained by a deduction of a simple 
addition of the mappings. 

A further development of this study is to analyze the interaction between 
different transitions. For this purpose we perhaps need much more samples of 
the rules, and therefore it is necessary to drastically improve the searching speed. 
I hope that this extension could open a wide range of potential studies for ap- 
proaching the origin of self-organization in Nature. 
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A Definition of Spatial Entropy of the Global 
Configuration of CAs 

Consider four adjacent sites of the cellular automaton space such as ( i + 

1 ,j), ( i,j + 1) and (i + 1, j + 1), where i and j are the x-axis and y-axis position 
of an arbitrary cell, respectively. There are 2 4 = 16 possible patterns for this 
local patch if each cell takes either ‘0’ or ‘1’ state. In this paper, we use 

H s (t) = Y1 P s(t)Io9w(PHt)), (5) 

k 

as the definition of the spatial entropy of global configuration at an arbitrary 
time step r, where P s fe (r) is the probability for a particular pattern of the local 
patch at the time step r. Note that a base of 16 is used for the logarithmic 
function as the entropy assumes a real number between nil and unity. 
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Abstract. This paper reviews the history of embedded, evolvable self- 
replicating structures implemented as cellular automata systems. We 
relate recent advances in this field to the concept of the evolutionary 
growth of complexity, a term introduced by McMullin to describe the 
central idea contained in von Neumann’s self-reproducing automata the- 
ory. We show that conditions for such growth are in principle satisfied by 
universal constructors, yet that in practice much simpler replicators may 
satisfy scaled-down — yet equally relevant — versions thereof. Examples 
of such evolvable self-replicators are described and discussed, and future 
challenges identified. 

1 Introduction 

In recent decades, the study of embedded self-replicating structures in cellular 
automata has developed into one of the main themes of research in Artificial 
Life [3 1,32]. A common goal motivates such research: to extract the fundamen- 
tal organizing principles that govern real-world biological phenomena (and, in a 
wider sense, life as we know it [13]) from an abstraction of their driving biophys- 
ical processes. Central among such processes is self-replication in its variety of 
forms. The eminent mathematician John von Neumann was the first to suggest 
that essential features of such biological self-replication, with its many degrees of 
freedom and complex kinematics, could be usefully represented in a discrete cel- 
lular space with uniform local rules[41]. His seminal self-reproducing automata 
theory, which he situated “in the intermediate area between logic, communica- 
tion theory and physiology” [41, p.204], set the stage for future research in ar- 
tificial self-replication [16] and remains among the defining achievements of this 
field[17]. 

Following von Neumann’s work of the late 1940s and early 1950s, research on 
CA-based self-replicators split into a number of major trends. Among these, the 
majority are efforts to implement regulated behavior (universal construction [8, 
9,20,38,40], self-replication[5,12,14,18,22,34], self-inspection[ll], functionality [7, 
19,37]) manually introduced to satisfy pre-defined goals of the designer. Such 
goal-oriented design is important for resolving theoretical problems (bounds and 
limitations on self-replicating structures) and for direct application (computa- 
tion, problem-solving, nanotechnology), yet does little to address the fundamen- 
tal issue that shaped von Neumann’s original theory. This issue centers on the 
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vague and intuitive concept of “complication” [41, p.78], roughly measured in 
terms of the number of elementary parts of a machine or organism. Von Neu- 
mann observed that natural organisms have a surprising collective tendency to 
increase such complication over time. Such increases are unexpected , resulting 
from robustness and adaptability of the natural self-replication process rather 
than from any specific functionality. Von Neumann proved that, in principle, 
such increases are not confined to natural automata but may equally be achiev- 
able in artificial ones, in particular by one of his own creation: a complicated 
universal construction machine embedded in a 29-state CA system. His machine 
realizes an ingenious separation between static description (tape) and active 
translation (machine) that has since become synonymous with the concept of 
self-reproduction[15]. Yet while significant as a defining example, von Neumann’s 
design has been widely critiqued for its reliance on a lengthy set of rules and ex- 
pansive space. His machine and others like it are so computationally demanding 
as to be largely unfeasible 1 ; hence simpler alternatives with comparable potential 
are sought. 

Considerable disagreement on the topic of such simpler replicating structures 
has arisen over the fifty years since von Neumann first developed his theory. Such 
disagreement stems from conflicting definitions of “self-reproduction” and from 
misconceptions regarding von Neumann’s original intent. Most commonly, the 
“self-reproduction problem” [3, p.xv] has been stated in terms of “non-trivial” 
self-reproduction, understood to mean reproduction of structures with some 
minimal level of complexity. First mentioned by Burks[4, p.49], this idea was 
later adopted by Langton[12] and subsequently incorporated as a guiding princi- 
ple in CA-based self-replication models [31]. Langton’s famous Self- Reproducing 
(SR) Loop, a model often mistakenly viewed as a simplification of von Neu- 
mann’s machine, embodies a common belief derived from this way of thinking: 
that minimal non-trivial self-reproduction is explicit (or self-directed[ 22]) self- 
reproduction. Such an interpretation is natural when one focuses on engineering 
aspects of von Neumann’s machine design, which indeed embody a highly explicit 
translation/transcription copy process. Yet it fundamentally overlooks an essen- 
tial property stressed by von Neumann himself, namely that “self-reproduction 
includes the ability to undergo inheritable mutations as well as the ability to 
make another organism like the original” [42, p.489]. Such capacity to withstand 
viable heritable mutations was central to von Neumann’s formal theory and to 
the evolutionary growth of complexity described therein [17, 36]. 

In this paper, we take a new look at the history of self-replication stud- 
ies in cellular automata. With hereditary variation in mind, Sections 2 and 3 
review key concepts underlying von Neumann’s original work, highlighting an 
area of the field that has received little attention. These are the recently de- 

1 Von Neumann’s machine required 29 states and a highly complex set of transition 
rules, occupying an estimated 50,000 to 200,000 CA cells[31, p.241]. Pesavento[20] 
recently simulated a modified version of this CA, yet was unable (due to prohibitive 
computational demands) to implement even a single complete self-reproduction cy- 
cle. 
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veloped evolvable loop- and worm-shaped self-replicators of marginal hereditary 
and structural complexity [1,6, 29, 30, 35], reviewed in Section 4, that straddle the 
boundary between von Neumann’s powerful yet fragile machine and more robust 
yet trivial systems. In Section 5, we identify future challenges and relate their 
importance in the field of Artificial Life to von Neumann’s original search for 
the evolutionary growth of complexity. 

2 Embeddedness, Explicitness, and Heredity 

A defining property of CA systems of self-replicators, embeddedness quantifies 
the extent to which state information of an individual is expressed in the arena 
of competition 2 . Taylor[36, p.216] argues that embeddedness is important be- 
cause it enables “the very structure of the individual to be modified” [original 
emphasis] , likely a necessary condition for open-ended evolution. Related to em- 
beddedness is the range of possible interactions that are allowed between objects 
or structures. Von Neumann’s machine is fully embedded; the entire specifica- 
tion of the machine — its various control organs, memory, and construction arm 

are expressed in the cellular space itself rather than hidden in auxiliary non- 
interactive locations. At any time, the construction process may be modified or 
disrupted via local rules applied to cells adjacent to the machine. This is dis- 
tinct from e.g. systems of evolutionary computer programs such as Tierra[21], 
popular and well-studied in Artificial Life, which assume an unlimited number 
of CPUs existing outside of the arena of evolution, beyond the influence of other 
individuals. Since CA by their nature do not “hide” any information (with the 
exception of the transition rules themselves), CA-based replicators are as a rule 
highly embedded and in general allow for direct and unrestricted interactions 
among them. This goes some way to explaining their popularity as a medium 
for the theoretical study of self-replication. 

Self-replicators embedded in CA share an important defining property with 
biological organisms: both are fundamentally built up from — and interact 
through — a common material structure grounded in physical laws (i.e. CA 
rules). Unlike other evolutionary systems such as those of computer programs, 
CA-based models supply no universal structural cues: everything down to the 
separation between replicator and its environment is distinguished relative to 
the system observer. This property has the side-effect of making such systems 
intrinsically “messy” to analyze, yet has the potential to elevate the richness 
and surprise of their evolutionary dynamics to levels inherently excluded from 
less embedded systems. Von Neumann never witnessed such dynamics; although 
his system is in principle capable of producing them, it is in practice far too 
computationally demanding and structurally fragile to be useful in this regard. 
This key difference manifests the importance of developing simpler, more robust 
models of self-replication imbued with the potential for evolvability. 

Since von Neumann’s original work, the majority of discussions on such CA- 
based embedded models of self-replication have focused on the explicitness of 

2 This discussion of embeddedness is based on an analysis by Taylor[36, p.215-219]. 
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the self-reproduction process rather than any potential evolvability contained 
therein[12,18,22,32]. Langton, who supplied possibly the most well-known argu- 
ment to this effect, makes the case that 

Von Neumann’s work suggests an appropriate criterion, which is all the 
more appropriate because it is satisfied by molecular self-reproduction: 
the configuration must treat its stored information in two different man- 
ners [...]: interpreted, as instructions to be executed (translation), and 
uninterpreted, as data to be copied (transcription). [12, p.137, original 
emphasis] 

Although von Neumann indeed based the design of his construction machine 
on this translation/transcription process, and although he did emphasize the 
interpreted/uninterpreted distinction at the heart of his theory, this does not 
imply that such separations should be considered an appropriate criterion for 
self-reproduction. A criterion based on explicitness alone is, in any case, some- 
what arbitrary as the transition rules of the CA will always play some role in 
enabling structures to copy themselves. 

More appropriate as criterion is the issue of heredity, on the basis of which 
simple replicators such as Langton’s Loop can be unambiguously distinguished 
from potentially evolvable machines such as that of von Neumann. Whereas 
Langton takes the translation/transcription process to be of primary interest, 
von Neumann emphasized static descriptions, the importance of which 

is that they replace the varying and reactive originals by quiescent and 
(temporarily) unchanging semantic equivalents and thus permit copy- 
ing),] the decisive step which renders self-reproduction (or, more gener- 
ally, reproduction without degeneration in size or level of organization) 
possible. [41, p.122-123, emphasis added] 

Such non-degenerative reproduction is an immediate consequence of construc- 
tion universality, an important result recently clarified and strengthened by 
McMullin[17]. Yet whereas such universality is computationally demanding and 
biologically implausible, a less stringent yet explicit encoding — one that en- 
ables a shift in heredity from limited towards indefinite — is in fact possible and 
indeed has already been implemented in a number of models[l,10,26,30,35]. Such 
a shift enables evolutionary complexity growth, described in the next section. 

3 The Evolutionary Growth of Complexity 

In his recent re-appraisal of von Neumann’s work, McMullin[17] summarizes 
three principal conditions he deems necessary to solve the problem of what he 
calls the evolutionary growth of complexity. Namely, to do so 

one would need to exhibit a concrete class of machines [...] in sufficient de- 
tail to satisfy ourselves that they are purely mechanistic; one would need 
to show that they span a significant range of complexity; and finally, one 
would have to demonstrate that there are construction pathways leading 
from the simplest to the most complex[17, p.351, original emphasis] 
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He goes on to show that von Neumann’s machine satisfies these require- 
ments, that it is thus capable of “incremental, bootstrapping increases in 
complexity” [17, p.354, original emphasis], and moreover that it is arguably the 
simplest machine so far capable of doing so. The proof of this result rests 
on the construction universality that von Neumann worked so hard to real- 
ize in his machine. Whereas the concept of complexity is notoriously ill-defined 
(von Neumann referred to his own understanding of it as “vague, unscientific 
and imperfect” [41, p.78[), this turns out not to be essential for demonstrating 
complexity-increasing construction pathways — the central piece of the puzzle. 

Having elucidated the above result, McMullin goes on to ask the important 
question: “what further conditions are required to enable an actual , as opposed 
to merely potential , growth in complexity?” [17, p.360, emphasis added]. Von 
Neumann’s machine does not qualify as “actual” primarily due to the fact that 
its structure is extremely fragile, unable to withstand minimal external pertur- 
bations. Such fragility would render any interactions — notably those between 
parent and child immediately following self-reproduction — highly destructive, 
preventing the coexistence of a population of such machines. In addition, neither 
current computational resources nor those of the foreseeable future hold much 
promise for implementing such a complicated system on any significant scale. 

There is an added concern, and this relates to the concept of a Darwinian — 
as distinct from mutational — growth of complexity. The former entails actual 
growth and requires direction from selection pressures, whereas the latter merely 
implies possible growth and only requires the presence of construction pathways. 
McMullin notes that, if the former were to occur at all, they would happen “along 
paths in the genetic network that lead “uphill” in terms of fitness” . Yet a fixed 
genetic network such as that of von Neumann’s machines “ may impose severe 
limits on the practical paths of Darwinian evolution (and thus on the practical 
evolutionary growth of complexity)” [17, p.358, original emphasis]. The question 
of whether or not a population of replicating machines, directly competing and 
evolving within such a network, would indeed favour increases in complexity is 
clearly an important one, yet one to which a complicated system such as von 
Neumann’s provides no concrete answers. 

One may thus ask: what then are the practical alternatives to fragile and 
complicated universal constructors, and are they capable of any evolutionary 
growth in complexity? Addressing the latter question, McMullin makes the im- 
portant admission 3 that the solution he presents “may seem to imply that a 
general constructive automaton (i.e., capable of constructing a very wide range 
of target machines) is a prerequisite to any evolutionary growth of complexity. It 
is not.” [17, p.357, original emphasis] If we are interested in studying complexity- 
increase during the earliest stages of life, a universal constructive automaton in 
any case hardly seems like the appropriate model; such a complicated machine 
would have to be preceded by simpler, less sophisticated replicators. It is to such 
simpler evolvable replicators that we turn our attention in the following section. 

3 McMullin makes this point in the context of a related discussion on genetic rela- 
tivism/absolutism. 




166 C. Salzberg and H. Sayama 



4 Evolvable Self-Replicators in Cellular Automata 

A wide array of self-replicating structures have been implemented in cellular 
automata since von Neumann first introduced his self-reproducing automata 
theory. Such models are often conceptualized as situated on a linear scale of 
“complexity”, with complicated universal constructors at one end and simple 
self-replicators at the other. Because the terms “simple” and “complex” are 
themselves so ill-defined, it is not always clear what properties should be asso- 
ciated with the intermediate region between these extremes. Sipper[31, p.245], 
for instance, situates self-replicators with added computational and functional 
capabilities^, 19, 37] in this region. Models such as these are indeed interesting 
from the point of view of achieving targeted behaviours, but in terms of studying 
the evolutionary process itself — and in particular that of complexity-increase 
— they fall short of the requirements stated earlier 4 . In particular, such models 
lack the “surprise!” [23] element that plays a crucial role in the emergence of 
complexity-increasing evolution. 

Taylor[36, p.200] presents an alternative 2D visualization in which each sys- 
tem is assigned to a point on a plane, with x and y axes representing the copying 
process (explicit/implicit) and heredity (limited/indefinite), respectively, of self- 
replicators. The diagram we have drawn in Fig. 1 is based on Taylor’s and, like 
his, is not intended to be quantitatively accurate; rather it is a qualitative tool 
to better visualize certain key concepts. Von Neumann’s machine, which carries 
out a highly explicit translation/transcription process, is also construction uni- 
versal and hence appears close to the upper left (indefinite/explicit) portion of 
the diagram. In contrast, self-reproduction of template-based replicators[10,34] 
is largely implicit in “physical” rules, allowing any arbitrary string to be repli- 
cated; hence they appear close to the upper right (indefinite/implicit) portion 
of the diagram. Langton’s Loop and other minimal self-replicators appear at 
the same level on the heredity scale as do trivial self-replicators; neither accom- 
modates any hereditary variation, although Langton’s CA has a more explicit 
translation process. 

Note that certain evolving self-replicating loops and worms — not yet devel- 
oped at the time when Taylor produced his thesis — are included at the center 
of the diagram, far from Langton’s Loop along the hereditary axis. It is this cen- 
tral region, representing self-replicators of marginal hereditary and structural 
complexity, that is our focus. Such structures should have the following virtues: 
they should have some potential for hereditary variability, should be sufficiently 
robust to self-replicate continuously without self-destruction, and should be suf- 
ficiently simple in specification to be realized on a computer. 

Chou and Reggia[6] designed the first such system while studying the emer- 
gence of CA-basecl self-replicating loops from a “soup” of components. Initiated 
with a random distribution of “unbound” cells, the soup is “stirred” (using 

4 This represents another interpretation of “complexity” , namely that of computational 
functionality (or universality). McMullin[17, p.348-349] addresses this perspective 
and finds it inadequate for describing evolutionary complexity-growth. 
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Fig. 2. Pattern formation of some different size-6 evoloop species after 5000 time steps. 



only local rules) until a minimal-sized loop appears by chance. This structure 
is identified by local rules, certain “bound” bits are set to distinguish it, and a 
self-replicating loop emerges. An evolutionary cycle then commences and con- 
tinues indefinitely: smaller loops, induced by “growth bits” scattered about the 
space, become progressively larger until the space cannot accommodate them, at 
which point smaller ones re-emerge. Chou and Reggia did not aim at studying an 
evolutionary process with this work but rather the emergence of self-replicators. 
Evolution in this model, explicitly induced and somewhat artificial, was not a 
priority. Heredity was also highly constricted, with each loop “species” defined 
exclusively by replicator size. 

Sayama’s “evoloop” model[29], developed shortly thereafter and based on 
his earlier self-replicating loop called the SDSR Loop[28], differs significantly in 
these regards. Although it borrows its basic structure from Langton’s Loop, the 
evoloop exhibits more complex and interesting evolutionary behaviour than do 
earlier minimal self-replicators of its kind behaviour traditionally assumed to 
require a more complicated system specification. Within this model occur evo- 
lutionary processes (viewed here as variation plus natural selection) that result 
exclusively from the local , structural interaction of embedded self-replicators. 
Such variation and selection processes are not defined a priori but rather emerge 
spontaneously from the low-level “physics” of the CA, a distinctly bottom-up 
feature that distinguishes it from earlier evolutionary models. Initially it was 
believed that such variation was limited to size only, but extensive and detailed 
studies[24] have since revealed that the genetic state-space grows combinatorially 
with loop size. Each genotype within this vast possibility space has unique growth 
patterns (Fig. 2) and genealogical connectivity [27], exerting a demonstrable and 
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decisive pressure on selection and resulting in a rugged fitness landscape [26], 
producing a limited form of complexity- increase in hostile environments [25]. 

Evolutionary systems of self-replicating structures with other shapes have 
also yielded interesting evolutionary potential. Morita and Imai[18] were the 
first to develop CA-based shape-encoding worms and loops. A variety of differ- 
ent structural patterns can replicate successfully in this model, however colli- 
sions leading to mutations are explicitly disallowed. Sayama[30] borrowed this 
basic shape-encoding worm structure and adapted it to enable gene transmis- 
sion through structural interaction, enabling worms to change their genotype 
(and phenotype) in a bottom-up way. Results from experiments with this model 
showed a tendency of worms to increase their average length over time, hence 
demonstrating a limited form of complexity-increase. 

Suzuki and Ikegami[35] applied the shape-encoding mechanism to study 
interaction based evolution of loop structures. Although more complicated 
than the models already mentioned, this CA allows for a variety of different 
loop-shaped structures (other than the traditional square) to successfully self- 
replicate, mutate, and evolve, and in this respect it is quite unique. The main 
thrust of this work is an interaction mechanism between replicating structures 
that makes use of a hypercycle-like network of collision rules. Using such a net- 
work, it was found that larger, more complex loop structures emerged at the 
boundary between waves of superior and inferior species in a propagating lry- 
percycle formation. 

Using a novel CA system incorporating both self-inspection and genetic re- 
production strategies, Azpeitia and Ibahez[l] investigated the spontaneous emer- 
gence of robust cellular replicators (self-replicating loops) . Emergence in this sys- 
tem is notably achieved without explicit mechanisms (e.g. “bound” or “growth” 
bits as in [6]). The detailed comparison of self- inspection and genetic reproduc- 
tion strategies in this study is moreover unique among those we have discussed. 
In their conclusions, the authors note: “experiments suggest that self-inspection 
reproduction could be a precursory stage for the genetic one.” Yet “despite hav- 
ing a better chance to trigger the reproduction dynamics, self-inspection would 
be taken over by the evolutinary advantages of genetic reproduction.” [1, p.142] 

Systems of template-based replicators[10,33,39] aim at simulating the emer- 
gence of living (i.e. self-replicating and possibly evolving) structures from a soup 
of non-living components under virtual physical laws; they hence bear similarities 
to the models already mentioned. Hutton[10] recently designed such a system in 
a CA-equivalent Artificial Chemistry (Squirm3) in which strands of “atoms” self- 
replicate via a set of reaction rules that apply locally throughout the space. No 
limitations are imposed on this replication process, hence any arbitrary strand, 
if immersed in a sufficiently large random soup of atoms, will self-replicate. Al- 
though the number of different possible “types” is thus quite high in this system, 
all strands have the same basic behaviour. Hutton discovered the emergence of 
self-replicators and a limited form of adaptation to the environment, although 
he concluded that more reactions were necessary to enable complexity growth. 




Heredity, Complexity, and Surprise 169 



5 Conclusion 

The systems we mentioned in the previous section constitute a small sample of 
the possible marginal self-replicators that could potentially be devised in CA. 
Results with these models, notably the genetic diversity and complex geneal- 
ogy recently discovered with the evoloop[24,26], demonstrate that complexity- 
increase of a limited kind is possible in practice using such evolvable explicit 
self-replicators. In many ways such models constitute the first step towards von 
Neumann’s original goal of complexity-increase in CA, steps that he himself 
could never take because his model was too fragile and complicated. 

Models of the kind discussed here are also important as a teaching tool. For 
a student who is learning about the principles of evolution, a system such as 
the evoloop has unique advantages over more complicated systems such as those 
of evolutionary computer programs. Namely, the basic mechanism for evolution 
(variation and selection) is in one case emergent , in the other case prescribed. 
Whereas the latter case assumes certain basic principals of biology, the former 
allows one to discover them. The analysis and insight that ultimately lead to 
such a discovery are at the heart of scientific thinking. 

An immediate future challenge is to incorporate within CA systems of self- 
replicators the possibility for functional interaction potentially leading to more 
complex hierarchical relations between organisms. The diversity of multi-cellular 
life that has evolved in the biosphere is clear evidence that self-replication at 
concurrent multiple scales plays a critical role in the growth of complexity. Al- 
though the emergence of self-replication has been shown[l,6,10], self-replication 
of macro-scale replicators from simpler micro-scale ones has not 5 . The study 
of self-replication and evolution — and in particular the search for complexity- 
increasing evolution — would likely benefit from work in this direction. 
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Abstract. This paper presents results of a study of a genetic algorithm, designed 
to evolve cellular automata for solving a given problem. Evolution is performed 
between the cell’s update-rules in a local manner, allowing for easy parallelization. 
As a case study, the algorithm was applied to the density classification problem: 
classifying any given initial configuration according to the percentage of 1-valued 
cells. The main result presented in this paper is an ’unlearning’ phenomenon: 
highly fit solutions are generated by the algorithm, only to be ’unlearned" and 
completely disappear as the evolutionary run continues. 



1 Introduction 

Cellular Automata were invented in the late 1940’s by von Neumann[l] as simplified 
models to study self-reproduction. Since then, CA have been widely used to study 
complex behavior. In such studies, the update rules which dictate the CA behavior are 
either designed per se or mimic the local behavior of problems at hand. The results 
and analysis are then based upon global behavior which is apparent (or not) in the 
computation over time. A well known example of such problem is Conway’s Game 
of Life, which displays complex behavior (and universal computation [11]) using very 
simple local rules. Other examples such as fluid flow simulation, growth modeling and 
Ising Spin models can be found in [4], [4] also includes theoretic analysis of CA and 
various implementation details and considerations. In all of the above examples, and 
many other, the CA rules are typically available or are inspired by nature. 

On the other side of CA research is the reverse problem: How to decide what rules 
should be used in order to perform a specific computation? Answering this question is a 
complicated and crucial issue, both for utilizing and implementing CA as real life tools 
and for explaining natural phenomena as the result of local interactions. 

One approach to this problem involves using Genetic Algorithms in an attempt 
to ’evolve’ a CA for solving a specific task (for example, [2,3,5]). A more extensive 
description of the field can be found in [8]. In 1996, Moshe Sipper published a Genetic 
Algorithm designed to co-evolve non uniform CA for performing computational tasks 
[6,7]. In his papers [6,7], Sipper shows that using that algorithm. Cellular Automata were 
evolved to performed several complex tasks. However, a study of the algorithm and its 
parameters is not presented. This paper presents some interesting insights based upon 
such a study. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 172-181, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




Unlearning Phenomena in Co-evolution of Non-uniform Cellular Automata 



173 



Algorithm 1 Pseudo-code of the cellular programming algorithm 

Parameters: 

— M - How many time steps the evolved CA would be allowed to perform when computing the solution. 

— C - How many initial configurations are used for rule evaluation. 

— Stopping criteria 

1 . Initialization: 

a) For each cell i in the CA do (possibly in Parallel) 

1. Initialize rule table of cell i. 
ii. fi= 0 { fitness value } 

b) c = 0 { initial configuration counter } 

2. while not done do: 

a) generate a random initial configuration. 

b) run CA on initial configuration for M iterations. 

c) For each cell i do (possibly in Parallel): 

1. if cell i is in the correct final state then fi = fi + 1 

d) c = c + 1 

e) if c mod C = 0 then { evolve every C configurations } 

1. For each cell i do (possibly in Parallel): 

A. Compute nfi (c) { number of fitter neighbors } 

B . if nfi (c)=0 then don’t change the rule of cell i 

C. else if nfi (c)= 1 then replace the rule of cell i with the fitter neighboring rule, followed by mutation. 

D. else if nfi (c)= 2 then replace the rule of cell i with the crossover of the two fitter neighboring rules, 
followed by mutation. 

E. else if nfi (c)> 2 then replace the rule of cell i with the crossover of the two randomly chosen fitter 
neighboring rules, followed by mutation. 

F. fi = 0 



2 Algorithm and Methods 

This section presents the algorithm designed by Sipper in [6,7,8] for co-evolving a CA 
for solving a given problem. We will use the same definition of Cellular Automata as in 
[7], where uniform CA implies that the cells’ state is updated using a single update rule. 
This is in contrast to non-uniform CA where each cell has its own local update rule. A 
more formal notion of Genetic Algorithms and the related operators (Crossover, mutation 
etc.) can also be found in that paper. The algorithm studied in this paper is presented in 
Algorithm 1 . At every iteration, the quality of the current CA rules is evaluated over a set 
of randomly selected initial configurations followed by creation of new rules using the 
better ones of the previous step. The algorithm is supplied with a method for the correct 
measurement of the cells’ final behavior/value. This can be in the form of an “oracle” 
able to evaluate the correct result for a given example or by some sort of a criteria to 
measure the output correctness (for example: correct behavior over time). After a given 
number of such examples have been presented to the CA and its rules are evaluated, an 
evolution step is performed. Each cell is compared with its neighboring cells. If the cell 
is the best cell in its neighborhood, it is left unchanged (this will be further referred to 
as elitism). If not, its better neighbors are used to create the cell’s new rule. 

The algorithm is different from the standard GA approaches to CA programing 
taken, for example, by Mitchel et al., in [5], In MitcheTs algorithm, the population 
consists of uniform CA and their evaluation is done independently. Therefore, there is 
no dependence between population elements of the same generation. However, in the 
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algorithm suggested by Sipper, the GA population are the cell rules and therefore their 
evaluation is highly interdependent. Due to the way evaluation is done, the cells are 
highly influenced by their neighbors. A ’bad’ neighbor can cause the current cell to be 
evaluated badly as well. Furthermore, evolution is done in a local manner, since each 
cell is evolved using only the neighboring rules. This leads to a very local but dependent 
evolution. Cells’ success affects the success of their neighbors and vice versa. In contrast, 
in Mitchel’s Algorithm, every CA (rule) in the population can be crossed over with any 
other in the population and not only with a very limited neighborhood. On one hand the 
local evolution used might hinder the algorithm, since it may prevent a good rule from 
being fully exploited. On other hand, such interrelations lead to fitter groups of rules: 
rules which are fit for working together. It is this interaction between rules and their 
effect on each other’s evolution process that gives rise to the name co-evolution. This 
algorithm is similar to the Diffusion Genetic Algorithms [9], where each element is only 
allowed to ’mate’ with its close neighbors. However, in that model, each element of the 
population is a ’solution’ while here, it is the population itself which serves as one. 

Note that the Algorithm itself can be viewed as an extended form of (stochastic) 
Cellular Automata, acting as a non-uniform CA for a fixed number of iterations, followed 
by a local evolution step. The algorithm can also be described in terms of an Abstract 
Cellular Genetic Algorithm, a framework for a parallel population based process [ 12, 
13]. This means that most of it can be performed, at least in principal, in parallel. 

2.1 Implementation 

This section describes the specific implementation used to study the algorithm. The 
description is divided into two parts, corresponding to the main ingredients of the algo- 
rithm: The non-uniform CA being evolved, and evolution related operations. 

Non-Uniform CA: The CA studied here are all two dimensional grids of size N x N 
(which will be referred to as size N in the rest of the paper). Following the reasoning 
presented by Sipper, the CA performs a number of iterations equal to the number of 
cells, N 2 . All cells have a value of 1 or 0 and have a von Neumann neighborhood with 
periodic boundaries. Each cell’s rule was stored as 2 5 bits denoting the the cell’s value 
in the next iteration for all possible setting of its neighborhood. In order to map the 
neighborhood configuration to a bit, the neighborhood cell values were converted into a 
binary number in the following order of importance: Cell value, Left, Right, Down and 
Up. 

Evolution Operators: The algorithm uses crossover and mutation operators. The im- 
plementation of those operators was as follows: 

- Crossover : To crossover two rules, a point is selected randomly along their lengths 
(at a corresponding location on both rules). The new rule, or the crossover result 
would be either the merger of the first part (before that point) of the first rule with 
the second part of the second rule, or the first part of the second rule with the second 
part of the first one. This choice was made using a 50%-50% distribution. 

- Mutation : In order to mutate a rule, each bit of the rule would be flipped with a 
probability p m , which is part of the algorithm’s parameter-set. 
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2.2 Solution Quality Assessment 

Investigating an algorithm immediately entails running it in various scenarios and com- 
paring the solutions produced by the algorithm. In this case this entails that it is necessary 
to have a method of comparing the evolved CA. Furthermore, the various CA that emerge 
during an evolutionary run need also to be compared and assessed. There is no guaranty 
that the best CA will be the one present at the last step. There exist several ways for 
defining such a CA fitness score (see [5], [8, page 82]), but here the average fitness of 
the CA’s cells will be used. This fitness assignment is easy to calculate from the data 
collected by the algorithm during its run and allows to save only the best CA. Note that 
this fitness does not necessarily depict the global demands of the given problem. This 
point will be further discussed in Section 4.1. 

3 Experiments 

As part of this study many experiments have been performed, designed to test the effects 
of the parameters (and some others criteria) on the learning capabilities of the algorithm. 
Only two of the results will be presented here. As a test case, we chose the Density Clas- 
sification problem in which the CA is required to detect whether the initial configuration 
has more then 50% 1 -values. If so, the entire CA has to be set to the value 1. Otherwise, 
all cells should have the value 0. This problem has been proven [10] to be unsolvable for 
a uniform CA: no uniform CA can classify all initial configurations correctly. However, 
this does not say anything regarding a non-uniform CA, nor how good a uniform CA 
can get (apart from never doing perfect). The initial configurations were generated in the 
following way: First, a desired density was drawn uniformly over [0,1] and then a con- 
figuration with such density was chosen randomly. This makes the problem somewhat 
easier since it spreads the density of the initial configuration uniformly over [0,1] while 
choosing an initial configuration completely at random would lead to a higher chance 
of it having a density in the neighborhood of 0.5, a region which have proven hard for 
CA. This problem has been investigated in detail by Mitchel et al., and Sipper [5, 7, 6,8]. 

4 Results 

Two aspects of the algorithm will be presented here. In the first part, three fit solutions 
are presented and analysed. The second part deals with an ’unlearning’ phenomenon 
discovered during the experiments. 

4.1 Solution Examples 

In this section, three examples of the best evolved solutions to the Density Classification 
problem are presented. Figure 1 presents and compares the solutions in several aspects. 
The first row of sub-figures shows the diversity of the rules among the three solutions. 
Each rule is colored uniquely such that a rule which is present in two separate solutions 
is colored in the same color in both. The second row of graphs shows the distribution 
of the rules present in each solution. On the x-axis the rules are ordered in an arbitrary 
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Fig. 1 . Three Density Classification solutions, evolved by the algorithm. 



order, which is maintained in the graphs in the third row. These graphs show, for each 
rule in the solution the average minimum and maximum hamming distance to other rules 
in the same solution. These quantities serve as a measure of how different the rules are. 
The fourth row presents a more detailed evaluation of the solutions. The graphs display 
the probability that the solution would compute a correct answer as a function of the 
density in the initial configuration. By correct it is meant that all cells of the CA display 
the correct answer (0 or 1). This was evaluated using 5000 initial configurations drawn 
at random and independently. Last, the solution’s average fitness is displayed, along 
with the solution’s true average quality. The true average quality of the solutions was 
evaluated using 5000 initial configuration as well. 

While being almost completely different, the three solutions have an almost equal 
average fitness. The left solution has the highest score of 0.9832 , followed by the middle 
one and 0.981 1 and the right one with 0.9796. However, looking at the true quality of the 
solution, or the probability that its output would be correct as defined by the problem, 
the situation is reversed. The best solution is now the right one, with a score of 0.929. 
Following is the middle one with a score of 0.8818 and last is the left solution with a 
score of 0.6734. The graphs showing the solution’s quality as a function of the initial 
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density show that the left most solution does not classify well densities larger then 0.5, 
resulting in its poor performance. These graphs also show that all solution perform 
poorly for densities in the range [0.4, 0.6]. This can be expected since in these domains 
the difference between the number of l’s and 0’s is small and therefore determining 
which value dominates is harder. 

Using two methods to evaluate solutions can be expected to give different results. 
The first method used is average fitness of the different rules, which is measured inde- 
pendently for each rule along 300 initial configurations. The second measures the entire 
CA, meaning that in order to achieve a good score all rules must display the correct 
output. Nevertheless, it was expected that the difference would not prove meaningful 
and that a solution which is deemed good by the average fitness score would also be good 
according to the full problem definition. The difference and its effect on correct classifi- 
cation put in doubt the choice of average fitness as a solution measurement. Differences 
such as the one between 0.98 and 0.67, are too large to be accepted. 

The distribution of rules, displayed in the second row of Figure 1 , suggests that higher 
rule diversity produces a better density classifier. The left most solution is dominated by 
a single rule, and performs the poorest. The right solution is the most diverse one and 
has the highest quality. 

While different, the rules are not far apart. The maximum hamming distance between 
two rules in all the solutions is 10 (out a maximum possible of 32) and this is reached 
only for two rules in the middle solution, which has only one instance each. Furthermore, 
rules that are more common tend to be more central. The averaged distance from those 
rules to others is relatively low. This is to be expected since the algorithm mutates good 
rules when they are copied to a different location. It follows that a set of rules which are 
close-together will result. 

4.2 A Look into the ‘Unlearning’ Phenomenon 

In several occasions during the experiments, it became clear that a good solu- 
tion can be evolved during a run, only to be ’unlearned’ or disappear. The good 

solution would gradually be replaced by a 
worse one, resulting in a poor quality solution. 
A clear example of this can be seen in Figure 2, 
where 10 runs of the algorithm are shown. 
As the figure suggest, ’unlearning’ is not rare, 
making our reproducible. Another point of in- 
terest is that the ’unlearning’ runs seem to 
converge to a fitness of around 0.79 (see Fig- 
ure 3 as well). In this section this ’unlearning’ 
phenomenon is presented in more detail. All 
experiments shown here relate to the Density 
Classification problem with the following parameters: N = 32, C = 300, p m = 0.001. 

One immediate question that arises is whether the unlearning is final: will the algo- 
rithm, after loosing the previous good solution, evolve a new one? This can be expected, 
since as the evolutionary run continues it is likely that a new good solution will emerge. 
In order to check this, several long evolutionary runs were performed, involving 10,000 
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Fig. 2. Unlearning phenomenon 
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evolutionary iterations. Figure 3 shows two such typical runs, one successful and the 
other displaying ’unlearning’. As can be seen, the ’unlearning’ is stable and final. Once 
a value of « 0.78 has been reached (after a peak of « 0.9) it remains roughly stable for 
the rest of the run. Note the sharp drop just before the thousandth iteration and the drops 
after 3500 and 7500 iterations. 

A different run where unlearning can be seen is presented in Figure 4. The first three 
figures display the average fitness and the number of instances of 18 rules. These rules 
had the most instances during the run 1 and are ordered according to this criteria. The 
rules are ordered 6 per graph, beginning with rules 1 to 6 , followed by 7-12 and 13-18. 
The rules are tagged with numbers resulting from reading the rule bit string as a binary 
number. Figure 4 also shows the average fitness of the solution, in similar fashion to 
previous experiments, to allow comparison and understanding of the run. In comparison, 
a successful run is displayed in Figure 5. 
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Fig. 3. Successful Learning and Unlearning - long evolutionary inns. 



As can be clearly seen in Figure 5, the unlearning beginning is accompanied by a 
sharp increase and decline in rule 250000192192. This rule first appears at time step 605, 
where the average CA fitness is fairly high, 0.955. In the next 35 steps, by time step 640, 
5 1 instances of this rule are present with an average rule fitness of 0.97. The rule continues 
to expand and reaches its peak in time step 662, where it has 171 instances : the maximum 
number of instances of a rule during the whole run. By this time the rule average fitness 
has dropped to 0.927 along with the average CA fitness (which is now 0.927 as well). 
At this point the rule begins to disappear, loosing instances while the average CA fitness 
continues to decrease. In time step 750, only 1 instance of this rule is present and its 
fitness is 0.867. Note the behavior of the second rule (186170228192), which is present 
in the period where the average CA fitness is high and disappears roughly when rule 
250000192192 reaches its peak. After the disappearance of rule 250000192192 no other 
rule is dominant and a mixture of rules is maintained. However, the average fitness does 
not rise again. These dominant patterns are notably different from the patterns displayed 
in the successful run, where rule diversity is kept along the run. 

This experiments are not sufficient to determine the cause of the unlearning phe- 
nomenon, nor to explain the complicated dynamics between rules during the evolu- 
tionary processes. However, it seems that the behavior seen in Section 4.1, where rule 

1 The 18 rules which had the best max value, where R l k denotes the number of instances 
rule i had at iteration k 
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diversity seemed to coincide with better density classifiers, reappears here. Once a rule 
managed to dominate the lattice, the quality of the CA decreases, causing a decrease 
in the fitness of the top rule and therefore allowing other rules to take over. The rule 
could only maintain its high fitness because of the presence of other rules in sufficient 
quantities. Being the best rule, it quickly multiplies reducing the number of other rule 
instances, to a point where it has diminished its ’source of power’ and it immediately 
starts to decrease. Since introduction and removal of rules is mainly done at the edges 
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of its continuous domain (inside evolution rarely occurs and if it does the rule would 
always survive crossover - the main creator of new rules), the process is gradual. Once 
the rule covered a large part of the lattice, several iterations are necessary for it to shrink 
back and perhaps disappear. 

This can explain the rise and fall of rule number 250000192192, but it does not 
explain why, after the rule disappeared, the evolutionary process does not manage to 
generate, once again, a good solution. Further, this might explain the relatively slow 
declines seen in Figure 3 but it does sufficiently explain the sharp drop and recovery 
just before the thousandth iteration nor the apparent convergence to a fitness sa 0.79. 
When using the Density Classification as a goal and fitness evaluation, the rules are 
highly interdependent. This fact makes analysis of rule interaction and understanding of 
the evolution process difficult. More experiments, and perhaps different measurement 
methods are needed. 

5 Conclusions 

This paper presents a study of an algorithm designed by M. Sipper [6,7,8] for program- 
ming non-uniform CA. The algorithm uses random examples (and their correct output) 
to evaluate fitness values of the CA rules and based on these, performs crossover and 
mutation operations in order to improve the current CA. The algorithm operates in a 
strictly local fashion, allowing easy parallelisation. 

The study investigated the effects of the algorithm parameters, using the 2D density 
classification problem as a test case. When applied to this specific problem and several 
other reported by Sipper, the algorithm was successful in producing highly fit solutions. 
Flowever, what is a fit solution and how it is defined has proven to be a key issue. One 
possible choice, and the one taken in this study and by Sipper, is to define CA fitness as 
the average fitness of its rules. This is a natural choice since it can be easily extracted 
from the fitness values of the rules, which are already calculated by the algorithm. 
Furthermore, since the algorithm is a genetic one, it is expected or, in some sense, it is 
the task of the algorithm to evolve a fit population 2 . However, due to the local nature of 
rule fitness evaluation, this has proven to be a problem. When correct problem output is 
defined in global terms, as in the density classification problem, it is not necessarily the 
case that a population which is in average locally fit will also be fit in a global sense. 
There is a large gap between the average fitness of a CA and its true global performance. 
Furthermore, these two measures are not correlated: CA with small rule diversity tend to 
be more fit but perform worse in absolute terms. CA using a richer set of rules are slightly 
less fit but perform significantly better. Since cellular automata are commonly used to 
produce global behavior, a general rule-fitness function, which is strongly connected to 
this notion, is needed. 

During the experiments it became apparent that the algorithm may sometimes loose a 
good solution and produce an inferior one. This phenomenon, named here ’unlearning’, 
was briefly adhered and seems to be caused by the interactions between the population 
members - the CA rules. The decrease in solution fitness coincides with an increase in 

2 The typical definition for the goal of a genetic algorithm is to find a highly fit solution, or 
individual. However, the algorithm is usually measured in terms of average population as well. 
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the relative representation of a single rule. The rule’s portion of the population increases 
until it has covered almost all cells, followed by a sharp decline. At the end of this 
decline the rule disappears completely and other rules come in its place. Given the 
apparent connection between rule diversity and the CA’s quality, this might explain the 
observed slow decrease in CA fitness. However, some of the other observed behavior is 
yet to be explained: sharp drops in fitness followed by and immediate recovery, apparent 
convergence to a specific fitness value and the algorithm’s failure to generate a new fit 
solution. In our experiments, ’unlearning’ was more common in the larger CA. This 
suggests that that ’unlearning’ is not caused by the finite size of the CA but further 
research on the effects of the size of the CA should performed. More experiments and 
new techniques are needed to study this phenomenon. 
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Abstract. Genetic Algorithms have been used before to evolve transi- 
tion rules for one dimensional Cellular Automata (CA) to solve e.g. the 
majority problem and investigate communication processes within such 
CA [3] . In this paper, the principle is extended to multi dimensional CA, 
and it is demonstrated how the approach evolves transition rules for the 
two dimensional case with a von Neumann neighborhood. In particular, 
the method is applied to the binary AND and XOR problems by us- 
ing the GA to optimize the corresponding rules. Moreover, it is shown 
how the approach can also be used for more general patterns, and there- 
fore how it can serve as a method for calibrating and designing CA for 
real-world applications. 

1 Introduction 

According to [6] Cellular Automata (CA) are mathematical idealisations of phys- 
ical systems in which space and time are discrete, and physical quantities take 
on a finite set of discrete values. The simplest CA is one dimensional and looks 
a bit like an array of ones and zeros of width N, where the first position of the 
array is linked to the last position. In other words, defining a row of positions 
C = {aq, <22, ..., djv} where C is a CA of width N and aw is adjacent to di. 

The neighborhood s n of a n is defined as the local set of positions with a 
distance to a n along the connected chain which is no more than a certain radius 
(r). This for instance means that S2 = {dus, di4g, di, 02, d3, d4, a.5} for r = 3 and 
N = 149. Please note that for one dimensional CA the size of the neighborhood 
is always equal to 2r + 1. 

The values in a CA can be altered all at the same time (synchronous) or 
at different times (asynchronous). Only synchronous CA are considered in this 
paper. In the synchronous approach at every timestep (t) every cell state in the 
CA is recalculated according to the states of the neighborhood using a certain 
transition rule 0 : {0, l} 2r+1 {0, 1}, s* — »• <9(sj). This rule basically is a one-to- 

one mapping that defines an output value for every possible set of input values, 
the input values being the ‘state’ of a neighborhood. The state of a n at time t 
is written as d( t , the state of s n at time t as and the state of the entire CA C 
at time t as (7* so that C° is the initial state and Vn = 1 ,N a (+ 1 = 0{s\ J. 
Given C* = {dj, ..., a* N }, C t+1 can be defined as {0(s*), ..., ©(s^)}. 
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Because a n £ {0, 1} the number of possible states of s n equals 2 2r+1 . Because 
all possible binary representations of m where 0 < m < 2 2r+1 can be mapped to 
a unique state of the neighborhood, O can be written as a row of ones and zeros 
R = {&i, 62 , b 2 2 r+i} where b m is the output value of the rule for the input 
state that maps to the binary representation of in — 1. A rule therefore has a 
length that equals 2 2r+1 and so there are 2 2 possible rules for a binary one 
dimensional CA. This is a huge number of possible rules (if r = 3 this sums up 
to about 3, 4 x 10 28 ) each with a different behaviour. 

One of the interesting things about these and other CA is that certain rules 
tend to exhibit organisational behaviour, independently of the initial state of 
the CA. This behaviour also demonstrates there is some form of communication 
going on in the CA over longer distances than the neighborhood allows directly. 
In [3] the authors examine if these simple CA are able to perform tasks that 
need positions in a CA to work together and use some form of communication. 
One problem where such a communication seems required in order to give a good 
answer is the Majority Problem (as described in section 3.1). A genetic algorithm 
is used to evolve rules for one dimensional CA that do a rather good job of solving 
the Majority Problem [3] and it is shown how these rules seem to send “particles” 
and communicate by using these particles [4] . These results imply that even very 
simple cells in one dimensional cellular automata can communicate and work 
together to form more complex and powerful behavior. 

It is not unthinkable that the capabilities of these one dimensional CA are 
restricted by the number of directions in which information can “travel” through 
a CA and that using multiple dimensions might remove these restriction and 
therefore improve performance. Evolving these rules for the Majority Problem 
for two dimensional CA using a Moore neighborhood (explained in section 3) is 
reported in [2] showing that the GA did not clearly outperform random search. 

The goal of the research is to find a generalization and report phenomena 
observed on a higher level, with the future goal to use this research for iden- 
tification and calibration of higher-dimensional CA applications to real world 
systems like parallel computing and modelling social and biological processes. 
The approach is described and results are reported on simple problems such 
as the Majority Problem, AND, XOR, extending into how it can be applied to 
pattern generation processes. 



2 The Genetic Algorithm 

As mentioned before, this research was inspired by earlier work [3,4] in which 
transition rules for one dimensional CA were evolved to solve the Majority Prob- 
lem (as defined in section 3.1). The GA is a fairly simple algorithm using bi- 
nary representation of the rules, mutation by bit inversion, truncation selection, 
and single-point crossover. The algorithm determined the fitness by testing the 
evolved rules on 100 random initial states. Every iteration the best 20% of the 
rules (the ‘elite’ rules) were copied to the next generation and the other 80% of 
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the rules were generated using single-point crossover with two randomly chosen 
‘elite’ rules and then mutated by flipping exactly 2 bits in the rule. 

To be able to compare two dimensional CA with one dimensional CA the 
GA used in section 3.1 is a copy of the the GA used in [3,4]- The GAs in section 
3.2 and 4 on the other hand are modified to fit the different problem demands, 
as will be explained in these sections. 

3 Experimental Results for Two Dimensional CA 

The two dimensional CA in this document are similar to the one dimensional 
CA discussed so far. Instead of a row of positions, C now consists of a grid of 
positions. The values are still only binary (0 or 1) and there still is only one 
transition rule for all the cells. The number of cells is still finite and therefore 
CA discussed here have a width, a height and borders. 

The big difference between one dimensional and two dimensional CA is the 
rule definition. The neighborhood of these rules is two dimensional, because there 
are not only neighbors left and right of a cell, but also up and down. That means 
that if r = 1, s n would consist of 5 positions, being the four directly adjacent 
plus a n . This neighborhood is often called “the von Neumann neighborhood” 
after its inventor. The other well known neighborhood expands the Neumann 
neighborhood with the four positions diagnally adjacent to a n and is called “the 
Moore neighborhood” also after its inventor. 

Rules are defined with the same rows of bits ( R ) as defined in the one dimen- 
sional case. For a von Neumann neighborhood a rule can be defined with 2 5 = 32 
bits and a rule for a Moore neighborhood needs 2 9 = 512 bits. This makes the 
Moore rule more powerful, for it has a bigger search space. Yet, this also means 
that searching in that space might take more time and finding anything might 
be a lot more difficult. In [2] the authors discourage the use of the Moore neigh- 
borhood, yet in section 3.2 and section 4 results clearly show successes using the 
Moore neighborhood, regardless of the langer search space. 

In a one dimensional CA the leftmost cell is connected to the rightmost 
cell. In the two dimensional CA this is also common such that it forms a torus 
structure. 



3.1 Majority Problem 

The Majority Problem can be defined as follows: Given a set A = {ai,...,a„} 
with n odd and a m € {0, 1} for all 1 < m < n, answer the question: ‘Are there 
more ones than zeros in A?’. 

The Majority Problem first does not seem to be a very difficult problem to 
solve. It seems only a matter of counting the ones in the set and then comparing 
them to the number of zeros. Yet when this problem is converted to the dimen- 
sions of a CA it becomes a lot more difficult. This is because the rule in a CA 
does not let a position look past its neighborhood and that is why the cells all 
have to work together and use some form of communication. 
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Fig. 1. This figure displays the number of rules that have a certain fitness value in 
the two dimensional experiment and compares this to the one dimensional experiment. 
The fitness bins are 0.01 in width and for both algorithm Fi 69 ,io 3 is calculated for 300 
rules. 
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Fig. 2. This figure shows a correct classification of the Majority Problem by a two 
dimensional CA with both width and height equal to 13 and A = 84/169. The transition 
rule was one of the best tested in the experiment and scored 1*169, io 3 — 0. 7 15. 



Given that the relative number of ones in C° is written as A, in a simple 
binary CA the Majority Problem can be defined as: Find a rule that, given an 
initial state of a CA with N odd and a finite number of iterations to run (I), 
will result in an ‘ all zero' state if A < 0.5 and an ‘all one’ state otherwise. 

The fitness (/) of a rule is is therefore defined as the relative number of 
correct answers to 100 randomly chosen initial states, where a ‘correct answer’ 
corresponds to an ‘all zero’ state if A < 0.5 and an ‘all one’ state otherwise. In [3] 
the authors found that using a uniform distribution over A for the initial states 
enhanced performance greatly; this is used here as well. The best runs will be 
tested using randomly chosen initial states with a normal distribution over the 
number of ones. The relative number of correct classifications on these states is 
written as F n ^ m where n is the width of the CA and m is the number of tests 
conducted. 

Preliminary experiments showed that it took much more time to evolve rules 
for the Moore neighborhood than for the von Neumann neighborhood. The tests 
that were done with the Moore neighborhood also did not result in any en- 
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couraging results, this being in line with [2]. That is why the von Neumann 
neighborhood was chosen for this experiment. Because this neighborhood con- 
sists of five positions, the search space for CA rules is a lot smaller than in the 
one dimensional experiment. Instead of the 2 7 = 128 bits in the rule, R now 
consists of 2 5 = 32 bits, thus drastically decreasing the serch space. This means 
that the search space decreased from 2 128 to 2 32 and is now 2^ 128-32 ) = 2 96 times 
smaller! 

For this experiment we used a CA with width = 13 and height = 13. This 
means that these CA have 13 x 13 = 169 cells (N) and are 169 — 149 = 20 cells 
larger than the one dimensional CA used in the original experiment. 

This algorithm was run 300 times and each winning rule was tested by calcu- 
lating Fjv,m using F’iggjo 3 - These results are plotted againts results of our own 
one dimensional experiment (not reported here, analogue to [3,4]) in Figure 1. 
The striking difference between this distribution of fitness and the distribution 
of fitness in the one dimensional experiment is the absence of the peak around 
Fn,m ~ 0.5 in the two dimensional results. In those results almost all the evolved 
rules have a fitness above 0.58. A fitness around 0.66 seems to be average and 
the best rules have a fitness above 0.7. That is all very surprising taking into 
account that the von Neumann neighborhood only consists of 5 cells. 

The Majority Problem is a good example of a problem that forces cells in 
a CA to ‘communicate’ with another. The communication ‘particles’ can be 
seen in the one dimensional experiment, but are not easily spotted in the two 
dimensional experiment. That does not mean there are no ‘particles’ traveling 
in the two dimensional CA, because it might be very hard to identify these 
particles. In a two dimensional CA ‘particles’ are no longer restricted to traveling 
in only one direction, but can travel to multiple directions at the same time. 
Traveling particles in two dimensional CA can therefore look like expanding 
areas with a distinct border. But there might be multiple particles traveling at 
the same time, meeting each other and thereby creating new particles. This is 
why communication between cells in a two dimensional CA is not very visible in 
the Majority Problem, although results show that this communication is present. 
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Fig. 3. This figure displays the iterations of a CA solving the AND problem. Every 
row shows the iteration of the rule using a different initial state. Note that in the first 
column ( t = 0) the initial states are clearly visible and in the last column the coloring 
matches the output of an AND port. 
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Table 1. Fitness values found in the AND problem. 



Fitness 


Number of runs 

Neumann Moore 


with crossover 


without crossover 


with crossover 


without crossover 


100 


0 


0 


31 


21 


98-99 


0 


0 


41 


54 


95-97 


0 


0 


14 


25 


90-94 


77 


93 


14 


0 


80-89 


23 


7 


0 


0 


70-79 


0 


0 


0 


0 


< 70 


0 


0 


0 


0 



3.2 AND and XOR Problem 

To show the communication between cells in a two dimensional CA a different 
experiment was conducted. A genetic algorithm was used to evolve rules for two 
dimensional CA that could solve the simple binary operators AND and XOR. 
These operators both have two input values and one output value which can only 
be determined if both input values are known. This is unlike the OR operator for 
example where the output value is always one if one or more of the input values 
is one, so if only one input value is known to be one then the value of the other 
input value is not needed. This may look very trivial, but it is very important 
in order to force the CA to combine the two values and thereby communicate. 

The AND Problem. To show the communications in a CA the information 
that needs to be combined must be initialized as far apart as possible. The 
following problem definition takes this into account: Given a square CA with 

two ‘ input cells’, one top left, and one bottom right: find a ride that iterates the 
CA so that after I iterations the CA is in an ‘all one’ state if both the ‘ input 
cells ’ were one in the initial state and in an ‘all zero ’ state otherwise. 

Small two dimensional CA were used with a width and a height of 5 cells 
and I was set to 10. The borders of the CA were unconnected to allow a larger 
virtual distance between the two corner cells. This means that the leftmost cell 
in a row was not connected to the rightmost cell in the same row and the topmost 
cell was not connected to the bottommost cell as was done with the Majority 
Problem experiment. Instead every cell on the border of the CA was connected 
to so called ‘zero-cells’. These ‘zero-cells’ stay zero whatever happens. 

When using two input cells, there are four different initial states. These states 
are written as where Vi and i >2 are the two input values. All cells other 

than the two input cells are initialized with zero. 

The fitness of a rule is defined as the total number of cells that have the 
correct values after / iterations. The number of ones in iteration t. is written as 
0* v i V2 y The total fitness of the AND problem is defined as / = (N — O 1 ^ 0 ^) + 
(N - O( 0tl) ) + (N - Ol m ) + Ol hl) . This makes the maximum fitness equal to 
4x5x5= 100. 
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In this experiment another variation of the simple genetic algorithm was 
used. A generation step starts by sorting the rules according to their fitness. 
Then it selects the top ten% of the rules as ‘elite’ rules and copies them without 
changes to the next generation. Every ‘elite’ rule is then copied nine times or 
is used in single-point crossover to make the other 90% of the population. Both 
methods were tested and compared. The generated rules are mutated and also 
moved to the next generation. Mutation is done by flipping every bit in the rule 
with a probability p m . The algorithm stops if it finds a rule with / = 100 or 
it reaches 1000 generations. In prelimenary experiments a number of different 
values of p m were tested. Setting p m to a rather high value of 0.05 turned out to 
be the most effective choice, confirming our insight that with increasing selection 
strength higher mutation rates than the usual j (/ beging the the length of the 
binary string) are performing better [1], 

The algorithm was run 100 runs with and without single-point crossover and 
using both the von Neumann and the Moore neighborhoods. The results are 
shown in Table 1. 

Although rules evolved with the von Neumann neighborhood are not able to 
solve the problem perfectly, it is already surprising that it finds rules which work 
for 93%, for such a rule only misplaces 7 cells in the final state. All the other 93 
cells have the right value. This suggests that the information was combined, but 
the rule could not fill or empty the whole square using the same logic. 

The Moore neighborhood is clearly more powerful and was able to solve the 
problem perfectly. The rules that are able to do this clearly show communi- 
cational behaviour in the form of “traveling” information and processing this 
information at points where information “particles” meet. 

It is also surprising that using crossover in combination with a Neumann 
neighborhood does not outperform the same algorithm without the crossover. 
This may be due to the order of the bits in the transition rule and their meaning. 
This is worth exploring in future work. Maybe using other forms of crossover 
might give better results in combination with multi dimensional CA. 
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Fig. 4. This figure displays the iterations of a CA solving the XOR problem. Every 
row shows the iteration of the rule using a different initial state. Note that in the first 
column ( t = 0) the initial states are clearly visible and in the last column the coloring 
matches the output of an XOR port. 
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Table 2. Fitness values found in the XOR problem. 



Fitness 


Number of runs 

Neumann Moore 


with crossover 


without crossover 


with crossover 


without crossover 


100 


0 


0 


0 
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98-99 
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95-97 
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6 


90-94 
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19 
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Fig. 5. The bitmaps used in the pattern generation experiment. 

The XOR Problem. The XOR Problem is not much different from the AND 
problem. We used the same genetic algorithm and the same CA setup. The only 
difference is the fitness function. We defined the XOR problem as follows: Given 
a square CA with two ‘ input cells’, one top left, and one bottom right: find a rule 
that iterates the CA so that after I iterations the CA is in an ‘all one’ state if 
only one of the ‘input cells’ was one in the initial state and in an ‘ all zero’ state 
otherwise. This means that the total fitness of the XOR problem is defined as 
f = (N - O[q >0) ) + Of 01) + O[ 1>0) + (N — 0[ M) ). 

The algorithm was run with p m = 0.05 for a maximum of 1000 generations 
for 100 runs with both Neumann and Moore neighborhoods with and without 
single point crossover. The results are shown in Table 2. 

These results support earlier finding in suggesting that single-point crossover 
doesn’t really improve the performance when used in a two dimensional CA. The 
results show that the algorithm using only mutation has found ways to solve this 
rather difficult communicational problem. The Neumann neighborhood seemed 
unable to perform for 100%, yet it came rather close with one rule classifying 
the problem for 92%. The algorithm found one transition rule using the Moore 
neighborhood that is able to solve the problem for the full 100%. This rule 
depicted in Figure 4 shows clear signs of “traveling particles” and is another 
example of how a local rule can trigger global behaviour. 

4 Evolving Bitmaps 

Now that it is shown that two dimensional CA’s can communicate, it is time to 
increase the challenge for the CA a bit. The aim of this experiment is to evolve 
rules for two dimensional CA that generate patterns (or bitmaps). 
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Table 3. Number of successful rules found per bitmap. 



Bitmap 


Successful rules 
(out of a 100) 


“square” 


80 


“hourglass” 


77 


“heart” 


35 


“smiley” 


7 


“letter” 


9 



UHHQ 

■ QT 

3D*SSTJ-TB5ffl 



Fig. 6. This figure shows some iteration paths of successful transition rules. 



The Bitmap Problem is defined as follows: Given an initial state and a specific 
desired end state: find a rule that iterates from the initial state to the desired state 
in less than I iterations. Note that this does not require the number of iteration 
between the initial and the desired state to be fixed. 

The CA used in this experiment is not very different from the one used in 
the AND/XOR experiment (section 3.2). In prelimenary experiments we tried 
different sizes of CA, but decided to concentrate on small square bitmaps with a 
width and a height of 5 cells (as done in section 3.2). To make the problem harder 
and to stay in line with earlier experiments the CA have unconnected borders 
like in section 3.2. The von Neumann neighborhood was chosen instead of the 
Moore neighborhood and therefore s n consist of 5 cells (r = 1) and a rule can 
be described with 2 5 = 32 bits. The searchspace therefore is 2 32 = 4294967296. 

After testing different initial states, the ‘single seed’ state was chosen and 
defined as the state in which all the positions in the CA are zero except the 
position (|_width/2j, |_height/2_|) which is one. For the GA we used the same al- 
gorithm as we used in the AND and XOR experiments. Because this experiment 
uses a Neumann neighborhood and the AND and XOR experiments suggested 
that the combination between the von Neumann neighborhood and single point 
crossover was not a good idea, this experiment used only mutation. Like in sec- 
tion 3.2 mutation is performed by flipping every bit in the rule with a probability 
p m . In this experiment p rn = 1/32 = 0.03125. 

In trying to be as diverse as possible five totally different bitmaps were chosen, 
they are shown in Figure 5. The algorithm was run 100 times for every bitmap 
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for a maximum of 5000 generations. The algorithm was able to find a rule for all 
the bitmaps, but some bitmaps seemed a bit more difficult than others. Table 3 
shows the number of successful rules for every bitmap. Note that symmetrical 
bitmaps seem to be easier to generate then asymmetric ones. 

Although this experiment is fairly simple, it does show that a GA can be used 
to evolve transition rules in two dimensional CA that are able to generate pat- 
terns even with a simple von Neumann neighborhood. Ongoing experiments with 
bigger CA suggest that they don’t differ much from these small ones, although 
the restrictions on what can be generated from a single-seed state using only a 
von Neumann neighborhood seem to be bigger when size of the CA increases. 

5 Conclusions 

This paper shows how two dimensional CA are able to solve the majority problem 
with similar results compared to one dimensional CA used in [3,4]. Using the 
same GA as in [3,4] a better average fitness was achieved suggesting that evolving 
two dimensional CA is easier and more reliable. 

The paper shows that two dimensional CA can show communicational be- 
haviour in the form of the AND and XOR. problems and that this behaviour can 
be evolved using a GA. The document also shows that a more generic behaviour 
can be evolved using a GA by showing how different patterns can be iterated 
from the same initial state. These results all suggest that a multi dimensional 
CA is a very powerful tool and in combination with GAs they can be evolved to 
exhibit specific bahaviour. It is therefore feasible that this combination can be 
used to solve all sorts of real world problems. 
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Abstract. Recently we have proposed a stochastic cellular automaton 
model of ants on a trail and investigated its unusual flow-density relation 
by using a mean field theory and computer simulations. In this paper, 
we study the model in detail by utilizing the analogy with the zero range 
process, which is known as one of the exactly solvable stochastic models. 

We show that our theory can quantitatively account for the unusual non- 
monotonic dependence of the average speed of the ants on their density 
for finite lattices with periodic boundary conditions. Moreover, we argue 
that the flow-density diagram exhibits a second order phase transition 
at the critial density only in a limiting case. 

1 Introduction 

There has been significant progress in modelling complex systems by using cel- 
lular automata (CA) [1,2]; such complex systems include, for example vehicular 
traffic [3] and biological system [4]. In most cases, particle-hopping CA models 
have been used widely to study the spatio-temporal organization in systems of 
interacting particles driven far from equilibrium[2,3]. In the traffic system, the 
vehicles are represented by particles while their mutual influence is captured by 
the inter-particle interactions. Usually, these inter-particle interactions tend to 
hinder their motions so that the average speed decreases monotonically with the 
increasing density of the particles. 

In a recent letter [5] we have reported a counter-example, motivated by the 
traffic-like flow of ants in a trail [6], where the average speed of the particles 
varies non-monotonically with their density because of the coupling of their 
dynamics with another dynamical variable. We have discussed the similarity 
and differences between our ant-trail model (ATM) and vehicular traffic. Our 
ATM model is a stoclratic CA model with two coupled variables, and reduces to 
the asymmetric exclusion process (ASEP)[7] in special limiting cases. 

In [8] we have developed a formalism by introducing the loose cluster ap- 
proximation and successfully captured the non-monotonicity lreuristically with 
the help of analytical results of ASEP, which is an exactly solvable stochastic 
model. In this paper, we study the phenomenon in detail by utilizing the analogy 
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Fig. 1 . Schematic view of the model for ASEP(a) and ZRP(b). The hopping probability 
of a particle depends on the gap size in front of it in ZRP, while it is constant in ASEP. 
In both cases, and the hopping to the occupied cell is prohibited. 



of ATM with the zero range process (ZRP) 1 , which is also known as one of the 
stochastic models that can be solved exactly [9, 10] (Fig. 1). 

The paper is organized as follows: ATM is defined in Section 2 and numerical 
results are shown in Section 3. Then we discuss the similarity between ATM and 
ZRP and give analytical calculations based on the analogy in Section 4, including 
the results in the case of the thermodynamical limit. Concluding discussions are 
given in Section 5. 



2 Ant Trail Model 

Let us first define ATM which is a simple model for an unidirectional motion of 
ants. The ants communicate with each other by dropping a chemical (generically 
called pheromone) on the substrate as they crawl forward [11,12]. The pheromone 
sticks to the substrate long enough for the other following sniffing ants to pick 
up its smell and follow the trail. 

Each site of our one-dimensional ant-trail model represents a cell that can 
accomodate at most one ant at a time (see Fig. 2). 

The lattice sites are labelled by the index i (i = 1, 2, ..., L); L being the length 
of the lattice. We associate two binary variables <S) and er,; with each site i where 
S-t takes the value 0 or 1 depending on whether the cell is empty or occupied by 
an ant. Similarly, cr, = 1 if the cell i contains pheromone; otherwise, ct* = 0. The 
instantaneous state (i.e., the configuration) of the system at any time is specified 
completely by the set ({5}, {<r}). 

Since a unidirectional motion is assumed, ants do not move backward. Their 
forward-hopping probability is higher if it smells pheromone ahead of it. The 
state of the system is updated at each time step in two stages. In stage I ants 
are allowed to move. Here the subset {S(£ + l)} at the time step £ + 1 is obtained 
using the full information ({5(£)}, {cr(£)}) at time t. Stage II corresponds to the 
evaporation of pheromone. Here only the subset {cr(£)} is updated so that at 
the end of stage II the new configuration ({S(£ + 1)}, {cr(£ + 1)}) at time £ + 1 
is obtained. In each stage the dynamical rules are applied in parallel to all ants 
and pheromones, respectively. 



We thank Martin Evans for drawing our attention to this analogy. 



l 






194 



K. Nishinari, A. Schadschneider, and D. Chowdhury 



q Q q 




ants 

pheromone 



ants 

pheromone 



if if if if 



S(t+1) 

c<4+l) 



X 



X 






J< 



ants 

pheromone 



Fig. 2. Schematic representation of typical configurations; it also illustrates the up- 
date procedure. Top: Configuration at time t, i.e. before stage I of the update. The 
non- vanishing hopping probabilities of the ants are also shown explicitly. Middle: Con- 
figuration after one possible realisation of stage I. Also indicated are the pheromones 
that may evaporate in stage II of the update scheme. Bottom: Configuration after one 
possible realization of stage II. Two pheromones have evaporated and one pheromone 
has been created due to the motion of an ant. 



Stage I: Motion of Ants 

An ant in cell i that has an empty cell in front of it, i.e., = 1 and Si+i{t) — 0, 

hops forward with 



probability 



Q if cr i+ i(t) = 1, 

q if <Ji+i(t) = 0, 



(1) 



where, to be consistent with real ant-trails, we assume q < Q. 



Stage II: Evaporation of Pheromones 

At each cell i occupied by an ant after stage I a pheromone will be created, i.e., 

<jj(t+l) = l if Si(t + 1) = 1. (2) 

On the other hand, any ‘free’ pheromone at a site i not occupied by an ant will 
evaporate with the probability / per unit time, i.e., if Si(t + 1) = 0, = 1, 

then 

, , f 0 with probability /, , . 

1 ' y 1 with probability 1 — /. 

Note that the dynamics conserves the number N of ants, but not the number 
of pheromones. The boundary condition used in this paper is periodic, that is, we 
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consider an ant trail of a circuit type. The case of the open boundary conditions 
will be studied in the successive paper [13]. 

The rules can be written in a compact form as the coupled equations 

Sj(t+ 1) = Sj(t) +mm(rij-i(t),Sj- 1 (t),l - Sj(t)) 

- mi - S j+1 (t)), (4) 

<Jj (t + 1) = max(6'j (t + 1) , min(<jj (t ) , (t))) , (5) 

where £ and ?y are stochastic variables defined by fj (t) = 0 with the probability / 
and £j(t) = 1 with 1 — /, and rjj(t) = 1 with the probability?) = q+(Q — q)(Jj+i(t) 
and r/j(t ) = 0 with 1 — p. We point out that eqs.(4) and (5) reduce to ASEP if 
we choose p as a constant, i.e., p does not depend on a. If we further consider 
the deterministic limit p = 1, then this model reduces to the Burgers CA[14], 
which is also known as an exactly solvable CA. 

3 Numerical Results 

The ASEP [7] with parallel updating has been used often as an extremely sim- 
ple model of vehicular traffic on single-lane highways [15]. The most important 
quantity of interest in the context of flow properties of the traffic models is 
the fundamental diagram, i.e., the flow-versus-density relation, where flow is the 
product of the density and the average speed. Thus it is interesting to draw the 
fundamental diagram of ATM to compare ants traffic with vehicle traffic. 

The fundamental diagrams of ATM is given in Fig. 3(a). The flow F and the 
average speed V of vehicles are related by the hydrodynamic relation F = pV . 
The density-dependence of the average speed of ATM is also shown in Fig. 3(b). 
We first know that the diagram does not possess particle-hole symmetry. In 
the ASEP the flow remains invariant under the interchange of p and 1 — p; 
this particle-hole symmetry leads to a fundamental diagram that is symmetrical 
about p = -f . In ATM, particle-hole symmetry is seen in the special cases of / = 0 
and / = 1 from Fig. 3(a). This is because in the two special cases / = 0 and 
/ = 1 the ant-trail model becomes identical to the ASEP with parallel updating 
corresponding to the effective hopping probabilities Q and q , respectively. 

Next, over a range of small values of /, it exhibits an anomalous behaviour 
in the sense that, unlike common vehicular traffic, V is not a monotonically de- 
creasing function of the density p (Fig. 3(b)). Instead a relatively sharp crossover 
can be observed where the speed increases with the density. In the usual form of 
the fundamental diagram (flow vs. density) this transition leads to the existence 
of an inflection point (Fig. 3(a)). 

By a detailed analysis of the spatio-temporal organizations in the steady- 
state, we were able to distinguish three different regimes of density. At low den- 
sities a loosely assembled cluster is formed and propagates with the probability 
q (Fig. 4). The leading ant in the cluster which hops with probability q will de- 
termine the velocity of the cluster. Homogeneous mean field theories fail in this 
region since these theories always assume that ants are uniformly distributed, 
which is not true in the case of ATM [8] . 
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Fig. 3. The average flow (a), speed (b) of the ants, extracted from computer simula- 
tion data, are plotted against their densities for the parameters Q = 0.75, q = 0.25 and 
L — 500. The discrete data points corresponding to / = 0.0005(0), 0.001(o), 0.005(»), 
0.01(A), 0.05(D), 0.10(x), 0.25(+), 0.50(*) have been obtained from computer simu- 
lations; the lines connecting these data points merely serve as the guide to the eye. 
The cases / = 0 and / = 1 are also displayed, which are identical to the ASEP cor- 
responding to the effective hopping probabilities Q and q, respectively. The analytical 
curve corresponds to / — > 0 in the thermodynamics limit, which is discussed in Sec. 4. 2 
is also depicted in (a) (the thick curve without ornaments). 

In the intermediate density regime, the leading ant occasionally hops with 
probability Q instead of q, because it becomes possible to feel the pheromone 
which is dropped by the last ant in the cluster. This arises from the fact that, 
because of the periodic boundary conditions, the gap size between the last and 
leading ant becomes shorter as the cluster becomes larger, so that the leading ant 
is likely to find the pheromone in front of it. This increase of the average speed in 
this intermediate-density region(Fig. 3(b)), leads to the anomalous fundamental 
diagram. 

Finally, at high densities, the mutual hindrance against the movements of 
the ants dominates the flow behaviour leading to a homogeneous state similar 
to that of the ASEP. In this regime we do not have the loose cluster any more 
and ants are uniformly distributed after a long time. Thus homogeneous mean 
field theories give a good result in accounting the fundamental diagram only in 
this regime. 

4 Analytical Results 
4.1 ZRP and ATM 

The ATM is closely related to the ZRP, which is known as one of the exactly 
solvable models of interacting Markov processes [9, 10]. The ZRP consists of the 
moving particles of the exclusion process. The hopping probability of a particle 
depends on the number of gaps to the particle in front. In the ATM, the hopping 
probability u can be expressed as 



u = 



9(1 - 9 ) + Qg, 



( 6 ) 
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Fig. 4. Spatial-temporal behaviours of loose clusters in the low density case ( p = 0.16). 
Parameters are Q = 0.75, q = 0.25,/ = 0.005). We see the loose clusters emerge from 
the random initial configuration, which will eventually merge after sufficiently long 
time into one big loose cluster. 

where g is the probability that there is a surviving pheromone on the first site 
of a gap. Assume that the gap size is x and the average velocity of ants is v. 
Since g(t- + 1) = (1 — f)g(t ) holds at each time step, we obtain g{x) = (1 — f) x / v 
after iterating it by x/v times, which is the time interval of between the passage 
of successive ants through any arbitrary site. Thus, in the ATM the hopping 
probability u is related to gaps x by [5] 

u{x) = q+{Q-q){l-fY'\ (7) 

Using the well known formal mapping between the ZRP and asymmetric exclu- 
sion processes [10] we conclude that the steady state of the ATM can well be 
described by that of the ZRP with parallel dynamics. 

By using the results of the ZRP, the average velocity v of ants is calculated 
by 

L-M 

V= ^2 u ( x )p ( x ) ( 8 ) 

X=1 

where L and M are the system size and the number of ants respectively (hence 
p = M/L is the density), and p(x) is the probability of finding a gap of size x, 
which is given by 



p(x) 



h(x ) 



Z(L-x-l,M-l) 
Z(L, M) 



(9) 



Since the ATM is formulated with parallel update, the form of h(x), as calculated 
in (9), is given by [16] 



h(x) 



1 - u( 1) 

-rr 1 - u(y) 

1 _ u ( x ) “A u (y ) 



for 

for 



x = 0 



x > 0 



(10) 
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Fig. 5. The fundamental diagram of the ATM with the parameter L = lOO(left) and 
L = 200(right). Parameters are Q = 0.75, q = 0.25,/ = 0.005. The smooth thin curve 
is the theoretical one, while the zigzaged thick one is the numerical data. 



The partition function Z is obtained by the recurrence relation 



L-M 

Z{L , M) = Y Z(L-x-l,M- l)h(x), (11) 

x=0 



with Z(x, 1) = h(x — 1) and Z(x, x) = h{ 0), which is easily obtained by (9) with 
the normalization Y1p{ x ) = 1- 

Next we draw the fundamental diagram of the ATM by using (8) with chang- 
ing p from 0 to 1. In drawing the diagram, we simply put the velocity v in (7) as 
v = q, which is known to be a good approximation for u [17] . In a strict sense v 
must be determined self-consistently by (7) and (8). Fundamental diagrams are 
given in Fig. 5 with L = 100 and L = 200. The thick black curve is the numerical 
data and the smooth thin black one is the theoretical curve in each figure with 
the specified value of L. We see that the theoretical curves are almost identical 
to the numerical ones, thus the ZRP is confirmed to describe the steady state of 
the ATM. 



4.2 Thermodynamic Limit of ATM 

Next we discuss the thermodynamic limit of the ATM, that is, the case L — > oo. 
From Fig. 5 we see that the curve shows sharp increase near the density region 
0.4 < p < 0.5, and the tendency is expected to be strong with the increase of L. 
Thus it is important to study whether there is a second order phase transition 
or not in the ATM in the thermodynamic limit. 

Following [16], the partition function Z(L,M) is written as 

Z(L,M)= Y, h(xi) ■ ■ -h(xM)S(xi + • • • + xm,L — M), (12) 
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where S(a, b) = 1 if a = b and S(a, b) = 0 otherwise. Considering p = M/L, (12) 
is rewritten by using the integral representation of the 8 function as 



Z(L, M) 



ds f G(s) \ M 
2nis Vs 1 /' 5-1 / ! 



where G(s) is the generating function of h defined by 



(13) 



G( s ) = '52 h (x)s x 



x=0 

By the saddle point method, in the L — »• oo limit we have 

Z{L, M) ~ exp(MlnG(z) — {L — M) In z) 
by keeping M/L = p as constant, where z is defined by 

1 1 9 In G(z) 

Then average velocity v is given in this limit as 

G(z) 



u(x)h(x) 
v = 2^ — 



(14) 



(15) 



(16) 



(17) 



X — l 



Next we study the properties of the generating function G(z) in detail to 
consider the phase transition of the ATM model. Since u(x) — > q (x — > oo) for 
/ > 0, the condition that G converges is given by 



lim sup 

x—>oo 



h(x + 1 )z 
h(x) 



1 -Q 



z < 1. 



That is, G converges in the range 

0 < z < z c = — . 

1 -q 



(18) 



(19) 



The critical value of G at z = z c is given by 



G(z c ) 



= l-u(l) + ^ 

X=1 



-rr f 1 - u (v ) g \ 

1_u ( a: )y=A u 1 ~ q ) 



(20) 



In the case / > 0, this sum diverges for the class of u(x) which decays to 
q more rapidly than the function q{ 1 + c/x), here c > 1 — q. In the ATM, 
u(x ) = q + (Q — q)( 1 — f) x ^ v , which decays exponentially to q as x — > oo. Then 
we conclude that there is no phase transition in the ATM for / > 0. This is 
because from (16), we have p = 1 when z = 0, and p = 0 at z = z c if G(z c ) 
diverges. Thus, so long as f > 0, in the entire density region 0 < p < 1 there is 
no singularity in G and, hence, no phase transition in the ATM. 
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The situation drastically changes when we take the limit / — > 0. In this case, 
we can set u(x ) = Q , and then G{z c ) becomes finite as 



lim G(z) 

Z — > — 2 C 



Q(i - g) 

<3(1 - q) - q( 1 - Q) 



(21) 



Then we have 
obtained from 



the phase transition in the case 
(16) as 



Pc = 



Q-q 

Q-q 2 ' 



f 



0 and the critical density is 



(22) 



Since u( x) —> Q in the case / — > 0, the average velocity at z = z c becomes 
from (17) 



v c 



E 



Q 

G(z c ) 



( 



l-o 

0 



(23) 



Substituting (21) into (23), we obtain v c = q. 

It should be noted [8] that (22) is also obtained by the intersection point of 
the line F = v c p and the ASEP curve [3] 



F=^(l- Vl-4Qp(l-p)) 



(24) 



in the flow-density diagram. The fundamental diagram in the thermodynamic 
limit is given in Fig. 3(a). Note that the limit L — > oo and / — > 0 do not 
commute[17]. If we take / — > 0 before L — > oo, then we apparently have (24), 
which is the case of our numerical simulations. If we take / — > 0 after L — > oo, 
then we have the thick curve in Fig. 3(a). Thus, in the latter case, we have proved 
that the anomalous variation of the average velocity with the density disappears. 



5 Concluding Discussions 

Several theoretical investigations have been carried out earlier, in terms of CA, 
to study the emergence of the trail patterns in ant colonies [4]. However, to our 
knowledge, our work is the first attempt to understand the traffic-like flow of 
ants on well formed trails using the language of CA. In this paper we have in- 
vestigated in detail the stochastic cellular automaton model of an ant trail [5]. 
The model is characterized by two coupled dynamical variables, representing the 
ants and the pheromone. The coupling leads to surprising results, especially an 
anomalous fundamental diagram which arises from an unusual non-monotonic 
variation of the average speed of the ants with their density on the trail in the 
intermediate regime of the ant density. We had earlier shown that the homoge- 
neous mean-field approximations are not able to capture the non-monotonicity 
quantitatively in the low density regime. In this paper we have reported our new 
quantitative results on this model; these results have been derived by utilizing 
the analogy with ZRP. Moreover we have shown that there is a phase transition 
in the thermodynamic limit in this model, albeit in a special limit. Extensions 
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of ATM, including counterflow and the case of open boundary conditions, will 
be reported in the future [13]. 

It would be interesting to test the predictions of the model, particularly the 
non-monotonic variation of average speed with density, by carrying out experi- 
ments on a circular ant-trail. 
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Abstract. A new software package, named Archirota, for simulating 
traffic in roundabouts is introduced. Its simulation module is entirely 
based on cellular automata and is automatically configured for real-world 
geocoded data. Archirota can be used both as a support for designing 
new roundabout and for modeling and simulating existing ones. Tests on 
actual use cases testified the effectiveness of the implemented approach. 



1 Introduction 

Among the many applications of cellular automata to complex system problems, 
the study of traffic is getting more and more popular in these very last years. 
The reason for that is the ease with which the dynamic features of traffic can 
be simulated with a cellular automata algorithm. With few simple rules the 
complexity of traffic (transition from free to congested traffic, lane inversion, 
stop and go traffic, nucleation of jams in correspondence of ramps etc.) can 
be reproduced in a way which is hardly achieved by models using differential 
equations ([13], [10], [5], [4]). Being computationally extremely efficient, large scale 
simulations can be accomplished with low cost computers and traffic forecasts 
can be envisaged. In order to simulate with cellular automata traffic on a road, 
the road is divided into cells whose occupation corresponds to the presence of a 
vehicle in that place at a certain time. A set of rules specify the time and space 
evolution of the system, that is how each vehicle moves along that road [6]. 

A slightly more complex problem is that of simulating a roundabout, that is 
a multiple (at least three) road junction in the form of a circle, around which all 
traffic has to pass in the same direction. Roundabouts were originally a particular 
feature of the British landscape, but in the last two decades they have widely 
spread all over the world. They replace crossroads, contributing to reduce car 
accidents and to slow down fast cars. Yet, they increase the flow of vehicles in the 
road because they tend to avoid the complete stop of cars which has as a direct 

* This work was partially supported by the Future & Emerging Technologies unit of 
the European Commission through Project BISON (IST-2001-38923). 
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effect the formation of jams. The accidents rate at roundabouts is reduced on 
average to one half of that of signaled junctions [2]. Finally, they offer lower cost 
and safer operation and are seen as a way to embellish settings with green and 
floral displays. The increasing number of roundabouts and their importance for 
the limitation of traffic jams and the reduction of accidents, fostered researchers 
into creating some instruments to help traffic engineers projecting roundabouts. 
Different softwares have been designed [3], showing different features and being 
currently used in many countries. 

In this article we describe a new software, named Archirota™, which can be 
used to simulate roundabout traffic. Its purpose is mainly to contribute to the 
proper design of new roundabouts and to the forecast of the traffic conditions 
which will be present in a given roundabout. A cellular automata algorithm is 
used in this software. The specific cellular automaton builds on the experience of 
the authors on cellular automata for environmental ([7], [8]) and specifically for 
traffic flow ([5]) simulation. Specifically, the algorithm is obtained essentially by 
modifying that used for modeling traffic on highways. Vehicles of different lengths 
occupy here from 1 to 9 cells (where each cell corresponds - parametrically - to 
2 m length), a feature which offers a wide dynamic range to the simulations. 
Vehicle speeds depend on the correspondence among cells and actual space. 
Cellular automaton evolution rules are accordingly defined. 

The Archirota™ software is written using the C++ language and its 
main features include: the capability to create the roundabout directly from 
a geocoded digital map; a variable number of roads and lanes per road; an 
average vehicle speed inside the roundabout depending on the radius of curva- 
ture of the roundabout; a variable insertion angle for each road; a dynamic and 
user-definable vehicle generation function and an easy modification of cellular 
automata rules. Finally, both a 2D and a 3D graphic interface are supported. 

Archirota™ has two output types: a visual simulation and a quantitative 
analysis of the traffic occurring at the roundabout. This includes: capacity of the 
roundabout, average crossing time for a vehicle along each path’; delay queuing, 
average speed, vehicle density and flow inside the roundabout. 

In the following we will first frame our work in current research on traffic 
simulation by means of cellular automata (2), then describe the essential features 
of the Archirota package (3) and finally show some use cases along with the 
output produced (4). 



2 Cellular Automata for Traffic Simulation 

In order to simulate traffic, we designed a cellular automata model which pro- 
vides a simple physical picture of a road system and is easily implemented on 
computers. Each road is divided into lanes and each lane is divided into cells, 
which can be either empty or occupied by a vehicle. Each lane starts at cell 
1; vehicles, which enter the road at its beginning move in discrete steps in the 
direction of increasing cell number. 
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A set of rules, described in details in reference [5], specify the time and 
space evolution of the system. These rules were derived from those originally 
developed by Schreckenberg and coworkers [11] and by Nagel and coworkers [12] 
to implement a multilane highway without ramps and with periodic boundary 
conditions. The rules specify at each step and for every car: a) the speed with 
which cars move along the road, b) how faster cars overtake slower ones and 
c) how cars slow down or accelerate according to the behavior of neighboring 
vehicles. Cars have a sufficient braking ability to avoid accidents. 

In each traffic simulation a starting configuration is chosen. New cars enter 
the road at a chosen rate as follows. For all on-ramps, at each iteration the 
program checks if the cell is empty. If empty, a random number is generated in the 
[0,1] interval and if it is less than the threshold chosen for that simulation, a car 
is generated in that cell. A number of local and average quantities are regularly 
recorded. At selected positions, the flow of vehicles and their average speed are 
collected. Regarding the flow, this is done counting the vehicles passing trough 
that location in a time interval divided by that amount of time. From these 
quantities, all other quantities of interest such as density (density =flow/speed) 
or the average distance between vehicles are computed. The same data are also 
collected as space averages: counting the number of vehicles present in a segment 
of the road at a given time provides the density. Their (average) speed allows to 
compute again the flow (flow= density x speed) and all other quantities. 

The use of averages is due first of all to the discreteness of cellular automata 
models. Cars in the model can only have as speed an integer number, while car 
speeds are real numbers in nature. Furthermore, experimental data are averages 
over time and/or space [9,13], so that a proper comparison again requires to 
average simulated data. As reported elsewhere [5], this model proved to be able 
to reproduce most of the experimental data available on traffic. 

Some additional rules must be introduced when simulating a roundabout. 
First of all a new object must be considered: the ring which connects the roads. 
Its on/off ramps are the end/beginning of the lanes connected by the ring itself. 
While in a simple road a vehicle disappears from the simulation at the road 
end, now it is passed on to the roundabout. When approaching the roundabout, 
a vehicle slows down and possibly stops because it has to give way to vehicles 
already inside the roundabout. A new rule is chosen for the speed limit of vehicles 
inside the roundabout so that this speed increases with the radius of the ring. 
Finally there is an impatient delay rule: car drivers at a road end waiting to enter 
the roundabout get progressively prone (up to a limit) to brake the respect of 
the rule defining the distance and approaching speed according to which they 
give way to vehicles already inside the roundabout. 



3 Archirota Specifications and Implementation 

The Archirota package was conceived to provide simulation support in actual 
roundabout design practice. Prospect users are decision makers who have to 
decide whether and how to construct a roundabout for a particular road junction. 
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Fig. 1 . The Ertha selection interface. 



The main program specification are therefore: 

— Possibility to work on actual digital city maps. 

— Possibility to include any relevant driver style. 

— Possibility to consider any relevant environmental influence. 

— Possibility to derive aggregate simulation data in the format decision makers 
are used to. 

— Possibility to easily define design alternatives and to support consequent 
what-if analysis. 

— Low cost and ease of use for unskilled practitioners. 

These specifications lead to the design of a very simple code, based on the 
cellular automata model introduced in section 2. The cellular automaton itself 
is of straightforward implementation, simply requiring to compute for each cell 
the next-iteration state on the basis of the current state of the cell itself and of 
those of its neighborhood. 

More precisely, following the notation introduced in section 2, we associated 
a geocoded cell vector Cj, j G J, to each entering and exiting lane and one 
to each lane of the roundabout itself. For each cell Cij, i € Ij of each vector 
J we then defined a cell-specific neighborhood N(ij), automatically including 
all cells which correspond to location that are visible for a driver occupying the 
lane position associated to ij. Visibilities are computed by simple computational 
geometric routines working on GIS-derived layers, such as buildings, green areas 
etc. 
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Fig. 2. The Archirota design environment. 

The simulation goes on by updating in parallel each cell state at each time 
instant. An outer loop, time instants, iteratively commands each cell to check 
its own state and that of its neighbors and to accordingly update its own next- 
time state. As mentioned, we have defined cell-specific neighborhoods, whereas 
the state domain is shared by all cells. Basically, each cell can be either free or 
occupied by a vehicle, in which case the type, portion and speed of the vehicle 
is recorded. The new state is simply a computation of whether, and by which 
vehicle, the cell will be occupied on the next time interval. The rules according 
to which vehicle positions are defined, as mentioned in section 2, are the same as 
those described in [5] except for the fact that the behavior of all visible vehicles 
is taken into account. 

Currently the architecture of the package requires the use of two different 
applications: a tailored GIS for selecting and exporting all relevant elements and 
the Archirota package itself. 

The GIS activities are currently supported by means of the desktop client of 
the Ertha GIS [1]. Ertha is an open source project currently under development 
at the University of Bologna aimed at providing source code for a multi-platform, 
multi-tier GIS. At the present date desktop, PDA and web browser clients are 
available, the former of which has been used for supporting the upload of the 
GIS project of interest, selecting all relevant layers and exporting the selected 
geocoded data. Figure 1 shows the interface of the tailored Ertha module. The 
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Fig. 3. Simulation of an existing roundabout. 



Fundamental Plot 




Fig. 4. Fundamental plot under light traffic conditions. 

exported layers are later uploaded by Archirota, which is the environment where 
the actual roundabout design takes place. In Archirota, vectors are automatically 
defined and dimensioned for each lane reported in the input file and all neighbor- 
hoods N(ij) are computed. Figure 2 shows the Archirota design environment. 
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Fig. 5. Fundamental plot under heavy traffic conditions. 
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Fig. 6. Fundamental plot under mixed traffic conditions. 



4 Computational Results and Use Cases 



Archirota was implemented in C++ and runs under windows. Although still 
under development, the main simulation features are by now stable and have 
been used to model different real-world cases. 

Figure 3 shows the main Archirota interface when a raster image of an ex- 
isting roundabout is loaded and a simulation is run on it. 

In this case we simulated the roundabout efficiency under two different traf- 
fic conditions. Specifically, we modified the incoming traffic rate for the roads 
from the left and from above. Figure 3 shows a snapshot of a congested traffic 
simulation case. 
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Quantified results are obtained directly from the cellular automaton by plac- 
ing ’’sensors” (i.e., counters) in user-definable roadside locations and recording 
traffic data for each sensor at each iteration. This data is later processed to ob- 
tain reports customized to user needs, even though some output is provided by 
default, which is the case for example of the traffic fundamental plot. 

Figure 4 shows the fundamental plot obtained under light traffic conditions, 
where traffic flows without developing significant queues. For this result we set a 
sensor on the road incoming from the left close to the entrance of the roundabout. 

Figure 5 shows the fundamental plot for the same roundabout of figure 4 
but with much increased incoming traffic. Now vehicle queues are permanent, 
and this is apparent from the plot since data points are not distributed along a 
line from the origin but are clustered in a region where a regression line has a 
definitely downward slope. 

Figure 6 finally shows again the same roundabout under mixed traffic con- 
ditions, this is a simulation of a time interval when for some minutes traffic was 
regular but at a certain moment it increases substantially. This could be the 
case for example of office or school closing hours. From the plot one can see how 
data point are initially on a line from the origin (regular traffic) and then move 
toward the congested area. 

5 Conclusions 

This paper presented the essential features of new a software package, named 
Archirota, for simulating traffic in roundabouts. The simulation module is ex- 
tremely simple and based on cellular automata. The neighborhood structure in- 
trinsic to cellular automata supports a great flexibility in simulation conditions 
and output data, thus an easy adaptation to different use cases. 

While the package interface still has to be improved, the simulation module 
and reporting facilities are by now stable and have been used to simulate real- 
world cases, both for designing new roundabouts and for modeling existing ones. 
In all cases the package results were adequate. 
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Abstract. Cellular Automata is applied to model the pedestrian flow, in which 
the local neighbor rules implemented to each person in the crowd are deter- 
mined automatically in the process of simulation. The collision patterns in the 
visible area settled around each person is introduced, and each person walks 
around just avoiding the collision patterns. As the simulation proceeds along 
the time step, various collision patterns are observed. By being informed the 
collision patterns to all the pedestrians simultaneously, the pedestrian begins to 
flow quite smoothly and the column formation is observed. 



1 Introduction 

Cellular Automata has been considered to be one of strong tools in modeling various 
kinds of phenomena such as pattern formation of natural system, fluid flow, traffic 
flow, city logistics or economical activities 1 ' 6 . Especially Cellular Automata is suitable 
in the simulation of the complex systems. Pedestrian flow is a typical example of the 
complex systems where one person walks toward the destination, paying attention to 
the people around him/her, and the pedestrian flow may be observed as assemblage of 
movement of each person. 

In the modeling by Cellular Automata, local neighbor rules and transition rules 
should be implemented to simulate time evolution of the phenomena to be consid- 
ered. But these rules have large effects on the simulation results, and careful consid- 
eration is required in defining these rules. In such cases as pedestrian flow, we have 
no standard rules describing the movement of each person in the crowd. 

In this paper, Cellular Automata has been applied to the simulation of pedestrian 
flow, in which local neighbor rules for a person to walk around in the crowd are de- 
termined automatically in the process of simulation. The notion of “collision pattern” 
is introduced in visible area settled around a person, and each person in the crowd is 
trained to avoid the collision patterns in the movement. Column formation along the 
pathway is also discussed. 
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2 Simulation Model and State Variables 



As an example model, a long and narrow pathway where people walk along in both 
directions has been simulated. The pathway is divided into square cells, and state 
variables are defined corresponding to “pathway floor”, “pedestrian”, “wall” and 
“entrance or exit”. For the cell whose state variables is “pedestrian” at a certain time 
step, “direction” and “velocity” are also defined as state variables. 

A part of simulation space is shown in Fig. 1 . A person walking from right to left is 
indicated as white circle, and from left to right as black one. A person may move in 
three directions; straightforward, forward to the right and to the left, as shown in 
Fig. 2. Each person on a pathway has four different maximum velocities, which are 
specific character walking along the pathway from the entrance to the exit. 

When the state variable is “pedestrian” on a certain cell, a visible area is settled 
around the cell depending on the direction as shown in Fig. 3. The direction to pro- 
ceed is chosen in reference to the state of each cell in the visible area. 

One side of the square cells is 1 m, and the whole pathway is divided into 17 x 102 
cells, which follows that the width of the pathway is 15 m, and the length 100 m. 20 
cells from the entrance or exit cells placed at both sides are assumed to the “approach 





Fig. 3. Visual area of each movement 
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cells”, where the pedestrian flow is not yet in the steady state. The data on the ap- 
proach cells are excluded in the analysis of simulation results. 

As the time step in the simulation is regarded to be 0.1 second in the actual world, 
the maximum velocities of people are set to 2.5, 1.4, 1.0 and 0.7 m/s, which are based 
on the measurement data. The difference of velocity is represented in the simulation 
so that a person can move one cell at every 4, 7, 10 and 14 time step. 

The pedestrian is set into the simulation space from the each entrance cell placed at 
both sides at the probability of 1 % at each time step according to random process. In 
the simulation, a person is prohibited to move to the cell where the state variable is 
already “pedestrian”, that is, each person cannot overlap each other on the same cell. 
In case when the cell where a person is going to move has already occupied by the 
other, the person should stop on the same cell. Further, when a person on the cell 
where the other are going to move has opposite direction, they are regarded to collide 
with each other. 



3 Acquisition of Local Neighbor Rules 

When no local neighbor rule is implemented to each person on the pathway for the 
movement, the pedestrian flow would not appear and people straggle along the path- 
way. One of the patterns of movement is shown in Fig.4. In such cases the collision 
of people can be often seen on the way. This is not the case we always see on the 
pathway, platform of stations or street. 

How to decide the local neighbor rules for pedestrian flow is very difficult, because 
we don’t have the standard rule to describe the movement of each people. Though we 
can, of course, implement any rules to the movement, they are regarded as reflecting 
the self-will of the engineer who makes algorithm of the simulation. 

The most obvious rule for pedestrian with which almost all the people may agree is 
just “escape from collision” to each other. In this sense, we would like to propose a 
new idea to make up the local neighbor rule automatically in the simulation process. 
In this algorithm, the people on the pathway can acquire the rules for movement 
automatically without being compelled to obey some given rules. The algorithm is 
described as follows. 



• — ► < — o 




Fig. 4. Example pattern of movement 
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(1) At first, all the pedestrians are flown into the pathway from the entrance cells 
without any rules. Each person often collides with each other on the pathway. 

(2) When a person collides with other person walking from the opposite direc- 
tion, the pattern of state variables of all the cells in visible area is memorized 
as the collision pattern. The pattern of the state variables to be memorized is 
characterized by the following three parameters; 

(a) existence of wall in visible area (yes:W, no:N) 

(b) the number of people in visible area walking in the same direction 

(c) the number of people in visible area walking in the opposite direction 
An example of the pattern is shown in Fig. 5. In this case, the pattern is indi- 
cated as “N0102”. This process is repeated up to 2000 time steps. 

(3) A person walking along the pathway sometimes collides with other person on 
the pattern characterized as the sequence of alphabet and numeric shown 
above, and sometimes walks smoothly enough without any collision. This is 
because the memorized pattern consists of mainly the number of specific per- 
sons in visible area, and has no information of exact position of other per- 
sons. Then, we introduce “collision probability” which is estimated from the 
simulation results repeated up to 2000 times as follows. 

“collision probability” = B / A 

A: total number of cases in which all the pedestrian face to this pattern 
B: total number of collision all the pedestrian make in this pattern 

In this estimation, the probability is regarded to be zero if A is smaller than 
100. The collision pattern and its probability are informed to all the pedestri- 
ans simultaneously. 

(4) A person on the pathway always pays attention to the condition of visible 
area, and tends to choose the pattern whose collision probability is minimum. 
If there are several patterns of the same collision probability, the person may 
choose at random. In case the collision probability of the pattern is zero, the 
person would continue to walk straight forward. 

(5) When collision occurs, back to (2), and then the process continues. 

The acquisition process of local neighbor rules described above may be equivalent to 
choose complementary patterns of the collision patterns. 
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Fig. 5. Example of collision pattern 
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4 Results and Discussions 

4.1 Typical Examples 

Typical examples of simulation results implementing the local neighbor rules de- 
scribed in the previous chapter are shown in Fig. 6. These are the selected scenes of 
animation at 2000, 4000, 6000, 8000 and 10000 time steps. As shown in the upper 
figures, a person on the pathway has no way to step aside to the other persons walk- 
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Fig. 6. Typical example of simulation 
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ing in opposite direction, and tends to collide with each other. As the persons on the 
pathway are trained to avoid the collision pattern or choose the pattern with minimum 
collision probability, the pedestrian flow is observed gradually, as shown in Figs. 6(d) 
and (e). 



4.2 Degree of Column Formation 

As shown in Fig. 6, the pedestrian flows in several columns along the pathway, which 
may be often seen in actual pedestrian flow around us. In this section, a quantitative 
study of column formation is described. 

The column formation is considered to be characterized by the number of columns 
and the number of people in each column. It is not easy to count the number of col- 
umn in each scene of animation, because the length and width of a column varies in 
each time step on the simulation. Then, in reference to Fig. 7, we estimate the number 
of columns and the average number of people in the columns. 

At first, a specific point is calculated on each row of cells. Each person walking 
from left to right corresponds to +1, and the person from right to left to -1. By adding 
the each point in every row of cells, the total point may be calculated for each row of 
cells. There may appear some boundaries of total points where the sign of point 
changes from positive to negative or vice versa. The number of columns in pedestrian 
flow may be characterized by this number of boundaries in the pathway. There are 
two columns in the example scene shown in Fig. 7. 

Next, by adding the absolute value of the specific point of all the row of cells and 
dividing the number of columns, the average number of people in columns is esti- 
mated. As an example shown in Fig. 7. the total absolute value is |+1| + |+5| + |+1| + |- 
1| + |-5| = 13. Then the average number of people in columns is 13/2 = 6.5. 
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Fig. 7. Number of columns in the example scene 
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Figure 8 shows the number of columns and the average number of people in col- 
umns calculated from the simulation results shown in Fig. 6. Though the number of 
columns varies in wide range at every time step of simulation up to 2000 or 3000 
time steps, it gradually converges from 4 to 6 in this case as the people are trained to 
avoid the collision. 



4.3 Collision Patterns and Their Probability 

The collision probability corresponding to each pattern is shown in Fig. 9. The pat- 
terns which are not observed in the simulation, or makes no collision are excluded 
from Fig. 9. The patterns whose collision probability is less than 0.1, are also ex- 
cluded. The variation of the number of collision which corresponds to the time steps 
in the simulation is shown in Fig. 10. This result is obtained by averaging the number 
of collision after the repetition of 10 times up to 10000 time steps simulation. During 
the time steps from 0 to 2000, the number of collision increases, because no training 
to avoid collision is implemented these interval. Just after 2000 time steps, the num- 
ber of collision increases suddenly, which is caused by the training algorithm where 
the number of data for collision is small and almost all the people on the pathway 
move straight forward. In these state, the typical flow pattern is almost same as the 
one shown in Fig. 6(b), where many people on the pathway cannot move due to colli- 
sion. 

As the training to avoid collision is repeated to all the pedestrian on the pathway, 
the number of collision decreased dramatically, and almost all the pedestrian walk 
along the pathway avoiding the collision around 8000 time steps. 
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Fig. 10. Average number of collisions in each 100 steps 



5 Conclusions 

In the present paper, the pedestrian flow on a pathway is simulated by Cellular Auto- 
mata, in which the local neighbor rules for the pedestrian are implemented automati- 
cally. The rule given to each person on the pathway is just to avoid the collision with 
each other, and this rule may be the way to go through a pathway how we have been 
all trained from childhood. The collision patterns of state variables in visible area are 
memorized and are informed to all the people so that they may avoid the collision 
patterns. These local neighbor rules implemented to the pedestrian may be considered 
as the complementary rules of the collision rules. 
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As a lot of previous authors pointed out. Cellular Automata is a strong tool in 
modeling various kinds of phenomena. But the local neighbor rules implemented in 
the simulation have definitely large effects on the results. Though in modeling a 
physical phenomenon we may introduce the rules based on some physical lows, it is 
difficult to choose the rules in modeling such cases as pedestrian flow. We have no 
standard rule except for avoiding the collision with each other. In this sense, the algo- 
rithm proposed in this paper is one of strong tools to define the local neighbor rules 
for the “agents” to move on the simulation space autonomously. 
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Abstract. Although diesel engines have an advantage of low fuel consumption 
in comparison with gasoline engines, several problems must be solved. One of 
the major concerns is that diesel exhaust gas has more particle matters (PM) in- 
cluding soot, which is suspected to be linked to human carcinogen. As one of 
the key technologies, a diesel particle filter (DPF) has been developed to reduce 
particle matters (PM). Since it is difficult to examine the local flow or tem- 
perature field in porous media experimentally, we try to simulate combustion 
flow. Two-phase Automaton is applied to produce the porous structure based 
on these particle behaviors. The porosity and tortuosity are freely changed by 
the profiles of the two-phase interface. Combustion field is simulated in this 
porous media flow, assuming that soot is attached to the wall surface. Results 
show that the soot is burned through the reaction with oxygen, similar to the 
regeneration process of DPF. 



1 Introduction 

Diesel engines have an advantage of low fuel consumption in comparison with gaso- 
line engines. However, several problems must be solved. One of the major concerns 
is that diesel exhaust gas has more particle matters (PM) including soot, which are 
suspected to be linked to human carcinogen. For this reason, more strict exhaust 
emissions standards such as Euro V in 2008 will be set in many countries. In Japan, 
the Tokyo municipal government has begun to regulate diesel-powered commercial 
vehicles that fail to meet the new emission standards. 

As one of the key technologies, a diesel particle filter (DPF) has been developed to 
reduce particle matters (PM) in the after-treatment of exhaust gas. Some filters have 
been introduced at the Frankfurt Motor Show in 2003. This can be applied to satisfy 
more strict regulations for diesel emissions, coupled with improvements of combus- 
tion conditions. In simple explanation of DPF, it traps the particles when exhaust gas 
passes its porous structure. Since the filter wall would readily be plugged with parti- 
cles in a short time, the accumulated particles must be removed. Usually, the filter is 
heated to burn the particles in combustion. It is called filter regeneration process. 
However, it is expected that the outer heating with the particle burning may destroy 
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the porous structure. Then, the thermal durable filter is plausible to maintain the low 
level of emissions in the long term [1]. So far, the filter has been developed mainly by 
experiments, and there may not be enough information to observe and understand the 
phenomena in DPF. For better design of DPF with efficiency and durability, it is 
necessary to conduct simulation with combustion in porous media. In conventional 
computational code, it is very challenging to deal with this process, because we need 
to consider the complex geometry with chemical reaction. 

Here, we try to simulate the combustion flow in porous media. The porous inner 
structure is very complex, which is normally determined by many molecular behav- 
iors when the media is produced or manufactured. However, since the hydrodynamic 
equations for these complex processes are either unknown or ill-posed, conventional 
CFD code can not be applied easily [2], Molecular dynamics (MD) simulation is not 
practical, because it is impossible to consider the real system due to the huge calcula- 
tion time and luck of memory. Then, to produce the porous structure, we focus on the 
Lattice Gas Automata (LGA), which is a kind of cellular automata for fluid simula- 
tion. LGA is an idealization of a physical system in which space and time are discrete 
and the physical quantities (i.e. the states of automaton) take only a finite set of val- 
ues. It should be noted that sufficiently large cellular automata often show seemingly 
continuous macroscopic behavior. Frisch et al. [3,4] have proposed a model, where 
hexagonal lattice is used to succeed in obtaining the correct Navier-Stokes equation 
by starting from the lattice gas automata. So far, this method has been applied to a 
large range of scientific problems including diffusion processes and wave propaga- 
tion. 

In this study, we use immiscible lattice gas (ILG) model, which is applied for two- 
phase flow [5], When we consider the two-phase interface as porous wall structure, 
we freely change the porosity and tortuosity of porous media [6,7]. Then, we simulate 
combustion flow in any porous structure to investigate directly the local heat and 
mass transfer in DPF. The combustion simulation is conducted by the Lattice Boltz- 
mann method (LBM), which is independent of LG simulation to produce porous 
structure. 



2 Numerical Model 

2.1 Porous Structure by Immiscible Lattice Gas 

Immiscible Lattice Gas (ILG) model for two-phase is used to produce the porous 
structure. It consists of a hexagonal (triangular) lattice with particles residing on the 
node in two-dimensional simulation (see Fig. 1). Two sets of Boolean variables, rfx, 
f)and b(x, tjfor red and blue particles respectively, are defined to describe the particle 
presence or absence for each phase. The lattice with unit lattice space, where there are 
six directions at each node, i = I to 6, are 
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( 7 = 1 , •••, 6 ), 



where e is the local velocity vector, and each particle has the same mass and velocity 
of unity. Properties such as density and velocity are determined by the collective 
behavior of many particles. As shown in Fig. 1, the local averaging is conducted in 
sub-domain to obtain the macroscopic quantities. 

Starting from the initial state, the configuration of red and blue particles evolves in 
two sequential sub-steps of collision and propagation. The first step is collision. 



r j (x,t*) = r j (x,t) + R i (r(x,t),b(x,t)\ (7 = 1 , •••, 6 ) 

b i (x/) = b i (x,t) + B i (r(x,t),b(x,t)), (7 = 1 , •••, 6 ) 



where, R t and /i is the collision operator. This collision occurs just after particles 
arrive at a node at t = t . They interact and change their directions according to scat- 
tering rules, which preserves mass and momentum. The collision rules for each phase 
are listed in Ref. [4]. However, in case of collision between red and blue particles, 
one unique collision is selected so that particles are preferentially moving toward 
concentrations of same color and away from concentrations of different color, result- 
ing in phase separation [4,8]. Figure 2 illustrates the example where one red particle 
collides with one blue particle, showing the particle motion before and after collision. 
Red and blue squares represent the regions of high concentration of each particle. 

At the next time step (t = t +1), each particle moves to the nearest node in the di- 
rection of its velocity, which is called propagation. 



r i (x+e i ,t + \) = r i (x,t*), (7 = 1 , •••, 6 ). 

b i (x + e i ,t + l) = b i (x/), (7 = 1 , •••, 6 ). 



Thus, the evolution equation of ILG model is as follows: 



n (x+e t ,t+ 1 ) = r (x, t) + R t (r(x, t),b(x, tj), (7 = 1 , •••, 6 ). 

/) (x+e i ,t+l) = lr (x, t) + B t (r(x, t),b(x , /)), (7 = 1 , •••, 6 ). 

The total particle density and momentum are defined and given by 

p = Y J (b +b ih 

i 

pn = Y j e i (r i +b i ). 
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As already mentioned, physical quantities are obtained by averaging the particle 
motion in sub-domain. This procedure is needed to eliminate statistical noise. When 
we take the large sub-domain, the spatial resolution becomes worse. However, there 
are the large fluctuation exists when the sub-domain size is too small. Here, the sub- 
domain of 16x16 nodes is used to obtain velocities and density through averaging 
procedure. The total computational domain has 2,124,081 grid points (3233x657). 
There are 101x41 sub-domains to determine the physical quantities. As for the 
boundary conditions, the periodic boundary is adopted both horizontally and verti- 
cally in this 2D simulation. The initial position of red and blue particles is random, 
and the velocity field is almost stationary. The particle number ratio of each phase is 
given to set the porosity. Although the two-phase flow can be simulated by LBM (e.g. 
particle-fluid suspensions [9]), LG simulation is performed in this study, because the 
rule for the two-phase interaction is very simple compared with LB equation. 




Fig. 1 . Calculation region and hexagonal lattice for LGA 





t=t 



t=t* 



Fig. 2. Collision between red and blue particles in ILG model 
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2.2 Combustion Simulation by Lattice Boltzmann Method 



As for the combustion simulation, we have used the Lattice Boltzmann Method 
(LBM) [10,1 1]. For example, the LB equation for temperature is, 



F T,cX x + e a S ,’ t + $ ,)~ F tA X ’ f )= l F T,A X ’ - F T q A X ’ ‘)] + W aQT 



where F T is the distribution function for temperature, x T is the relaxation time deter- 
mined by thermal diffusivity, and w 0 = 4/9, w a = 1/9 (a = 1:4), w a = 1/36 (a = 5:8) for 
9-bit lattice BGK model in two-dimensional square lattice space. The source term, Q T , 
is heat release by chemical reaction. As for the soot combustion, we have already 
tested the real sample of Ni-Cr metal for benchmark study in porous media flow [12], 
Here, we apply the same scheme to the porous media formed by ILG model. It is 
assumed that the soot is homogeneously attached to the wall surface, which is similar 
situation in the regeneration process of DPF. The over-all reaction by Lee et al. is 
adopted for soot oxidation [13]. For simplicity, any catalytic effect is not considered. 
More information on the soot oxidation is found in the review article by Stanmore et 
al. [14], In the simulation, the oxygen concentration in inflow is changed. The oxygen 
concentration is an important parameter in engine operation, because it is easy to 
control the oxygen concentration by using the exhaust gas recirculation (EGR) [15], 
in order to prevent an abrupt temperature rise from damaging the local structure of 
DPF. 

Figure 3 shows the coordinate and boundary conditions. The calculation domain is 
5 cm x 1 cm, and the inflow velocity, U jn , is 20 cm/s. The total number of grids is 
201 X41, with grid size of 0.25 mm. The porous structure of 100 grid length ( L ) is 
placed in the center part in this figure. In the calculation, all equations are non- 
dimensionalized based on similarity to obtain the temperature and concentration 
fields. As for the boundary condition, the inflow boundary is adopted at the inlet [16]. 
The temperature and mass fractions are those of the air at room temperature. At the 
sidewall, the slip boundary conditions are adopted, considering the symmetry [17]. At 
the outlet, the pressure is constant, and the gradient of scalar such as temperature is set 
to be zero. On the surface of the porous wall, the bounceback rule for non-slip bound- 
ary condition is adopted. 



Slip B.C. 




Slip B.C. 

Fig. 3. Coordinate and boundary conditions in porous media flow 
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3 Results and Discussion 

3.1 Porous Structure 

Figure 4 shows time evolution of two-phase automaton. The fraction of red particle is 
shown. The total density ratio of blue particles to red particles is about three, corre- 
sponding to the porosity of 0.75. Time steps are 50, 1000, 10000, and 40000, respec- 
tively. The initial position of each particle is random. The average density per node is 
about 1.5. As seen in this figure, the phase separation is observed. At later time step, 
the red spot colligate each other to become larger spot. Resultantly, the interface 
surface is decreased monotonically. The final equilibrium state is full separation with 
a few large red spots. 

Here, we consider the blue phase as volume available for flow and red phase as 
solid media. The interface of two-phases is regarded as porous wall. The key pa- 
rameters of porous structure are porosity and total wetted surface. The pore size dis- 
tribution may be important, but the porosity and total wetted surface are used in the 
theoretical formula to describe the porous structure [18]. These two parameters are 
shown in Fig. 5 to see the time dependence. The wetted surface, S, is non- 
dimensionalized by the porous region length of L. In 2D calculation, the wetted sur- 
face corresponds to the wetted perimeter. It is clearly shows that only the wetted 
surface is decreased as the time step goes. Then, by this model, we can choose any 
wetted surface of porous media with porosity constant. 

Then, we simulate the flow field in created porous structure. We choose three 
structures with porosity of 0.75, which are cases 1, 2, and 3. The wetted surface of 
porous media is smaller from cases 1 to 3. These are based on two-phase profiles at 
the time steps of 1000, 7000, and 30000. Figure 6 shows the distribution of non- 
dimensional velocity in the .r-direction, u x /U in . It is found that the flow is fluctuated in 
the porous region and the flow direction is largely changed, even when the flow is 




Fig. 4. Time evolution of two-phase automaton, illustrated at representative time steps 
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Fig. 5. Porosity and total wetted surface as a function of time step 




-0.3 3 9 



Fig. 6. Flow field in porous media of (a) case 1, (b) case 2, and (c) case 3. 



uniform at the inlet. The velocity is locally accelerated in the narrow path, since the 
obstacle is randomly placed. As a whole, the flow is smooth in case 1 . In cases 2 and 
3, the negative velocity is observed, which means that the recirculation flow exists. 
Therefore, with different porous structure, the flow pattern is changed at the same 
porosity. 

The pressure distribution is also examined for the above three cases. Figure 7 
shows the averaged pressure distribution along y-direction, which is normalized by 
the constant pressure at the outlet. In the porous region shown by length of L, the 
pressure starts to decrease almost linearly, although it slightly fluctuates in cases 2 
and 3. After passing this region, the pressure is almost constant. To compare three 
cases, it is found that the pressure at the inlet is increased as the larger wetted surface, 
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which could be due to the larger effect of shear forces. In future study, our numerical 
scheme will be extended to produce 3D porous structure to consider the real porous 
media. Although the simulation is conducted only in 2D geometry, it is demonstrated 
that the different flow is observed in the produced porous media by ILG. 




3.2 Combustion Field 

Next, the combustion flow is simulated. It is assumed that the soot is homogeneously 
attached to the wall surface with its mass fraction of 0. 1 . The mass fraction of oxygen 
in the air is 0.233. When the porous wall is heated, the soot is burned to react with 
oxygen. Time, t, is counted after we set the wall temperature for 1200 K. Figure 8 
shows the distributions for temperature of /', mass fraction of oxygen and soot of Y m 
and Y c , and the soot reaction rate of W c . The temperature and reaction rate are non- 
dimensionalized by inflow temperature and maximum reaction rate. Time step is 
5000 after heating, and the porous structure is that of case 2 in Fig. 6. It is found that 
the oxygen concentration around the wall surface is decreased by the reaction with 
soot. Then, the maximum temperature is higher than the wall temperature. It is inter- 
esting to note that the reaction rate locally varies, simply because the oxygen trans- 
port is different by the convection. 

Moreover, we change the oxygen concentration in inflow, which is an important 
parameter to control the reaction rate. Here, nitrogen is added in the air. We set the 
oxygen concentration (mass fraction) to be 0.233 (air) and 0.1. The total mass of soot 
is same for both cases. As seen in Fig. 8, the reaction rate is locally changed. To make 
clear the effect of oxygen concentration, the reaction rate is averaged at the wall sur- 
face. Results are shown in Fig. 9. The reaction rate is non-dimensionalized by the 
peak value in case of air. It is found that, as the oxygen concentration is lower, the 
mean reaction rate is decreased. Then, the reaction of soot is completed at the earlier 
stage, compared with dilution case. This is very reasonable [12]. We conclude that the 
combustion simulation in porous media is well demonstrated. 
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Fig. 8. Distributions of temperature, mass fraction of oxygen and soot, and reaction rate 
porous media of case 2 




Fig. 9. Mean reaction rate of soot as a function of time after heating 
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4 Conclusions 

Using two-phase automaton of ILG model, we have successfully obtained the porous 
structure. Setting the ratio of red and blue particles, the porosity is easily given. The 
tortuosity of the media is determined by the degree of two-phase separation. Then, the 
flow characteristics are freely changed by choosing the porosity and wetted surface. It 
has been demonstrated that, using the LBM, the flow is simulated in different porous 
media structure. For simplicity, the simulation has been conducted only in 2D ge- 
ometry, but this idea to produce the porous media structure can be extended to 3D 
simulation. Also, the soot combustion has been well simulated. For example, there are 
regions where the temperature is locally higher. The effects of oxygen on the reaction 
rate are clearly recognized. This information is indispensable for the better design of 
DPF. 
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Abstract. In this paper a new cellular automata model suitable for 
granular systems simulation is presented. The proposed model is shown 
to be equivalent to a particularization of the well known BCRE model 
of granular systems and a correspondence between the parameters of the 
presented model and the BCRE model is also set, allowing to fit these 
parameters for a given system. The model has the advantage over other 
cellular automata models of being more realistic in the behavior of the 
surface of heaps and slopes. The dynamics of the CA is analyzed in order 
to confirm that it also has one of the most important features of these 
systems, 1// noise. 



1 Introduction 

Granular systems behavior has been widely studied during the last decades, due 
to its many applications to industry. They show a complex behavior that cannot 
be easily classified in one of the usual matter states; solid, gas or fluid. 

The traditional approach for studying its dynamics has been based in flu- 
ids models, describing the flow of granular systems and the formation of heaps. 
However granular systems show characteristics, such as the appearance of macro- 
scopic patterns or avalanches, that cannot be properly described using this ap- 
proach. By the other side, Cellular Automata (CA) have also been used for 
granular systems modelling. Several works have been published that use CA 
models to study the statistical properties of granular systems [1,2,3]. However 
the formation of heaps and the way the size of a pile of matter evolves is better 
described by fluid models. 

When simulating the behavior of a granular system in a computer graphics 
application, the most relevant part is the visualization of the evolution of the 
external surface of the pile along time [4] . Models based on fluid dynamics do not 
suit quite well to such applications, since their computational cost makes them 
difficult to apply to these kind of real-time simulations. In opposite, CA based 
models describe the granular system as a grid, therefore, as has been shown in 
[5], they can be solved very efficiently. 

In previous works [5,6] some of the CA models found in the bibliography [1,7] 
were reviewed in order to obtain a more realistic heap and slope formation. The 
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main problem of these models is their anisotropy, which is specially notable when 
a heap is formed by the addition of matter in a single point [6]. The resulting 
heap has polygonal section, instead of being conic. 

Thus, on the one hand continuous models are not suitable for real-time sim- 
ulation. On the other hand, CA-based models fit well the requirements of this 
kind of application, but do not show a behavior realistic enough. The objective 
of this work will be to obtain a CA-based model with a behavior somehow equiv- 
alent to that of continuous models. This model fills the need of a realistic and 
computationally inexpensive model. 

In order to achieve this, a model that is shown to be locally equivalent to 
an existent continuous model is proposed. The model presented by Bouchaud, 
Cates, Prakaslr and Edwards [8] (also known as BCRE model) will be used as 
the starting point, because of its simplicity and its intuitive description. 

The BCRE Model. A granular system can be considered as a system with 
two layers; the standing layer, that forms the slope or the heap of the system, 
and the rolling layer, that is a thin layer that flows on the surface of the slope 
[9]. The BCRE model [8] uses this schema and provides a set of equations to 
describe the evolution of these layers’ thickness. The model can be formulated 
in one dimension as follows: the system has two state variables, the height of the 
static layer, s(x,t) and the height of the rolling layer, The variation of 

these variables along time is expressed by the set of equations 

r t = vr x - j(a - |s x |)r (1) 

s t = 7 (a - |s x |)r 

where a is the so called angle of repose of the system, 7 is a parameter that 
expresses the rate of matter transfer between layers and v is the speed of the 
rolling layer that is considered constant. 

Equations (1) express that the thickness of both layers will depend on two 
major processes: on the one hand the transfer of matter from the rolling layer 
to the resting layer or vice-versa, and on the other hand the exchange of matter 
at each point of the rolling layer with its surroundings. 

This model was extended to 2 dimensions [10] leading to a set of partial 
differential equations on the variables r(x,y,t) and s(x,y,t ) 

ry = t>V(rVs) — 7 (a — |Vs|)r (2) 

St = 7( a “ |Vs|)r 

using the same notation as in equations (1). Equations (2) define the 2- 
dimensional continuous model that will be used later in this paper. 

Following, the update rule that defines the new model will be presented, 
and the relation between this new model and the BCRE will be explained. In 
the third section, some properties of the new model will be shown in order to 
demonstrate that, despite the goal of this work, the main statistical property of 
most CA models is kept. 
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2 Description of the Model 

In order to define an update rule for a CA model that reflects the behavior of 
a continuous surface, some simplifications to the two layer description of the 
granular system that was seen before have to be done. 

To define the new model we will work under two main assumptions. The first 
one is that the thickness of the rolling layer is constant, being h = r + s the 
height of the system at every point. The second assumption is that the surface 
of the granular system h(x, y) is a two times differentiable function. 

A CA on an TV x N square grid is considered. This grid represents the plane 
on which the granular system is laying. The value of each cell h(i,j) £ 1R will 
represent the height of the system on the cell’s center (ay, y ? ). The set of points 
{(xi,yj, h(i, j))}ij is a discretization of the surface {(x,y,h(x,y))}. Notice that, 
although the update rule is defined using the derivatives of h(x, y), an expression 
in terms of the CA variables can be easily obtained. 

The update rule is as follows. For each cell, (i,j), the gradient Vh(xi,yj ) 
is computed. If the angle of the slope arctan(|V/i|) is lower than the repose 
angle threshold a, the value of cell (i. j) remains unchanged. On the other case, 
if arctan(|V/i|) > a, the height h(i,j) is reduced z + ■ (h x + h y ), h(i + 1 ,j) is 
increased z + ■ h x and h(i,j + 1) is increased z + ■ h y , that is: 

h(i,j) <- h(i,j) - z+ ■ (. h x (i,j ) + h y (i,j)) (3) 

h(i+ l,j) «- h(i + 1, j) +2+ • h x (i,j) (4) 

h(i,j + 1) <- h(i,j + 1 )+z + - h y (i,j) (5) 

where z+ is a parameter of the model which indicates the velocity of flowing 
matter, and h x , h y are the partial derivatives of h(x,y). 

From the assumption that the values of the automata are a discretization of 
the surface of the system, the value of V/i can be obtained taking the approxi- 
mation of the derivatives by differences 



h x {i,j) = y-j) 

h y(iJ) = 



h(xi + 1 ,yj) - h(xj,yj) 

Xi 

h(xj,y j+ 1 ) - h(xj,yj) 
Vj+i - Vj 



This rule is based in the fact that the gradient vector field —Vh(x, y) indicates 
the direction of maximum slope at every point in the surface of the heap. When 
a cell is updated, if the angle of the slope is higher than the repose angle, the 
matter flows following the direction of the strongest slope. 



2.1 Relation with the BCRE 

Once the update rule has been set, the relationship between the behavior of this 
CA model and the BCRE continuous model will be shown. In order to obtain 
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this relationship the variation of both models at a given point (a:,, yj ), and along 
a fixed time interval St, will be obtained. It will be shown that, under certain 
assumptions, the difference between the models tends to zero if the time interval 
and the cell size are reduced. 

The system evolution after time St is represented in the CA by an update of 
the grid using the specified rule. The variation of the height of a cell (i,j) will 
depend on the update of the cell (i,j) itself, and on the update of cells (i — 1 ,j) 
and ( i,j — 1), as they can drop grains into cell ( i,j ). In a situation in which 
these three updates are not null, the difference between the height of a cell of 
the automata, before and after the update, is 



Ah c = 2 + ( h x (i,j ) + h y (i,j) - h x (i - 1 , j) - h y {i,j - 1 )) 



( 6 ) 



To see the variation of the BCRE model after time St, the thickness of the 
rolling layer r will be considered constant in an open set A including a given 
point ( Xi,yj ). According to equation (2), in ( Xi,yj ) the variation of the total 
height of the system, h = r + s, is 

h t = r t + s t = wV(rVs) (7) 

that, as r is constant around ( Xi,yj ), equals 

= iu’V(Vs) = vrV(VA) = vr ( h xx + h vy ) . (8) 

Developing the Taylor series of h(xi,yj,t) as a function of time, one obtains 

Ah b = h(t + St) — h(t) = h t .St + R{St ) 2 (9) 

= St vr(h xx + hyy) + R(5t) 2 . 

In a similar way, Taylor series of h x (-,yj,t) considered as function of x and 
h y {xi, •, t) considered as function of y give 

i f \ h x (xi+\,yj) — h x (xi, yj) . . 

h'xx\%i->yj) — -^l(*^2+l 3Ci) 

X%- 1-1 Xi 

hyy (xi, yj ) = h v U " lljt + MVi+1 - V:) ■ 

Vj + 1 yj 

From Equation (9), assuming x^ — Xi - 1 = yj — yj-i = d 

'h x (x + d,y) — h x (x,y) 



h{t + St) — h(t) = St vr 



(10) 

( 11 ) 

(12) 



h y (x,y+d) - h y {x, y) 



+ (Ri + i?2)d 



+ R(St) 2 . 

Expressions (6) and (12) give respectively the variation of the proposed CA 
model, Ahc, and of the BCRE model, Ahs, after an interval of time St. Taking 

Stvr , , 

2+ = (13) 
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and subtracting equalities 6) and (12) one gets 

Ahc — Ahs = St vr{R\ + R, 2 )d + R(St ) 2 (14) 

that represents the difference of the evolution of both models at point (a ’i,yj). 

Therefore, considering the situation in which the thickness of the rolling layer 
in the BCRE model is constant around (a:*, yj ), and in which the three cells (i, j), 
— and (i,j — 1) are effectively updated (i.e. cell (i, j) is in a flowing region), 
both models converge locally if St and d tend to zero. 

Notice that the assumption of a constant rolling layer implies that the matter 
flow is not null in a certain region around point ( Xi,yj ). In the terms of the 
CA representation this means that, for d small enough, the update of the four 
neighbors of cell ( i,j ) is not null. This is, precisely, which is the assumption 
made over the CA model to get equation (14). 

Thus, as a conclusion, equation (14) expresses that in a point around which 
the variation of the rolling layer is neglectable, both models lead to the same 
behavior. 



3 Avalanches and 1/ f Noise 

Granular systems are considered a paradigmatic example of systems that evolve 
to a state called Self Organized Criticality (SOC) [11]. When in this regime, 
the system remains stable but near to unstable states. When it is perturbated, 
the system evolves through a set of unstable states until a new stable state is 
reached. This behavior is usually illustrated by a slope of sand into which grains 
are dropped randomly. When the slope is higher than a certain angle, the grains 
start rolling and an avalanche takes place until the system gets relaxed. 

Some numerical experiments have been prformecl in order to test if one of 
the most characteristic marks of this behavior, the so called 1 // noise, is also 
present in the model. 1// noise refers to the frequency spectrum of certain events 
such as avalanches: in a critical system the frequency at which a certain event 
happens is proportional to the inverse of its magnitude. 

Reproducing the tests performed by several authors [1,12,13,14], the fre- 
quency of the size of avalanches along a slope, and the correlation between the 
frequency and the size have been obtained. As a measure of size three values 
have been chosen; its length as the number of steps that lasts until the system 
stops again, the area of the avalanche or the number of cells it affects, and the 
mass displaced, that is the sum of all the matter transfer at every update. Figure 
1 shows the power spectra of the measured events. 

They have been approximated by a theoretical 1 / f~@ function and the corre- 
lation of the data have been calculated. The results of this tests are: for the length 
of the avalanches, /3 = —6.255718 with a correlation r = 0.862181, for the area, 
/3 = —3.375581 and r = 0.933205 and for the displaced mass, f3 = —7.518815 
and r = 0.960261. 

The correlation of the measurements of frequency to the theoretical function 
indicates that the well known behavior of granular systems as critical systems 
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Avalanche length Amount of matter displaced 





Affected area 




Fig. 1. Frequency spectra for the length, the area and the amount of matter affected 
by an avalanche, showing the existence of an *// _/3 relationship. The dashed line shows 
the regression of the data to a function of that family. 



is possible in the presented model. This fact, together with the results obtained 
in section 2.1 make the model a good tool for the analysis and representation of 
granular systems. 

4 Application of the Model 

The integration of the developed model in a more complex application has been 
done during the development of the Mobile Crane Simulator UVSim, that is 
currently used in Valencia Harbor. The model presented in this paper is used as 
a dynamic model for the behavior of bulk material within the complete dynamic 
model of the virtual environment in the simulator. 

In addition to the 3D graphics representation of the system, some interaction 
modes have been defined in order to allow the manipulation of the bulk mate- 
rial. Further details on the implementation and on the 3D representation of the 
system using the CA model can be found in [5] . 

5 Conclusions 



In this paper a model addressed to real-time simulation of granular systems that 
encloses properties of both continuous media models and CA models has been 







236 



M. Pla-Castells, I. Garcia, and R.J. Martinez 



defined. On the one hand, it has been shown that the evolution of the surface 
of the system is equivalent to the well known BCRE model. On the other hand, 
some experiments have been done in order to see that the model also keeps at 
least one of the most characteristic statistical properties of CA. 

The evolution of the model has been analyzed at an arbitrary point; it has 
been shown that the local behavior of the surface is the same that in the BCRE 
model under the assumption that the rolling layer of the system does not vary 
substantially near the analyzed point. It has been also stated a relationship 
between the parameters of both models. The amount of matter displaced z+ 
is in equation (13) and the threshold of the automata is set as arctana. This 
correspondence allows a direct parameterization of the new model taking as a 
starting point the parameters of the BCRE model. 

Hence, the model fulfills the main requirements stated in Section 1; it behaves 
as one of the most extended continuous models for granular systems, and has 
the advantage that the CA implementation is suitable for real-time applications. 

Also it has been shown that, although the model behaves locally as a con- 
tinuous media model, the statistical behavior of the model is consistent with 
previous CA models. The power spectra of three variables related to avalanches 
have been obtained, showing that 1 // noise, a paradigmatic characteristic of 
granular systems, can be present in the dynamics of the model. 

It can be concluded that the model that has been presented in this work cov- 
ers the main goals proposed in this paper, providing an improvement of previous 
CA models. 

As future research, the model should be completed to fit the BCRE model in 
the general case, and perhaps other continuous models should also be translated 
to a CA model. The analysis of the statistical properties of the model has to 
be completed in order to determine the different regimes of its dynamics. The 
possibility of pattern formation can also be studied simulating the experiments 
usually performed with real granular systems. These works will provide a more 
robust evidence of the validity of the CA model for granular systems simulation. 
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Abstract. MGS is an experimental programming language for the mod- 
eling and the simulation of discrete dynamical systems. The modeling 
approach is based on the explicit specification of the interaction struc- 
ture between the system parts. This interaction structure is adequately 
described by topological notions. The topological approach enables a 
unified view on several computational mechanisms initially inspired by 
biological or chemical processes (Gamma and cellular automata). The 
expressivity of the language is illustrated by the modeling of a diffusion 
limited aggregation process on a wide variety of spatial domain: from 
cayley graphs to arbitrary quasi-manifolds. 



1 Introduction 

In this paper, we are interested in the modeling of discrete dynamical systems 
with some internal structure that arises from their evolution function. 

Consider the following discrete process: a particle q moves randomly on a 
square lattice. So, if q is on node (x,y) at time t, then at time t + 1 it occupies 
one of the node {x + e x , y + e y ) where e x , e y are randomly chosen in {—1,0, 1} 
such that | e x | + \e y \ ^ 0. The building of a synchronous cellular automata (CA) 
that simulates this process is not immediate because the update rules change 
the state of only one node. It will be much more easy to allow rules that update 
synchronously a entire subset of the nodes. If this kind of rule is allowed, then it 
would be trivial to define this process as the simultaneous update of two neighbor 
cells, one empty and the other containing a particle, by the exchange of their 
state. 

The problem of updating a subset of cells defined by some arbitrary con- 
ditions is often solved using an asynchronous dynamics: this avoids that two 
occupied cells decide at the same step to occupy the same empty neighbor. An- 
other solution is the partitioning of the cells into a coarser graph (this is the 
approach of lattice gas automata [TM87]). 

We see here these two approaches mainly as tricks that do not account that 
a system can be decomposed into subsystems defined by the requirement that 
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the elements into the subsystems interact together and are truly independent 
from all other subsystems parallel evolution. 

In this view, the decomposition of a system S into subsystems Si, S 2 , ■ ■ - , S n 
is functional: the state Si(t + 1) of the subsystem S, depends solely of the 
previous state Si(t). However, the decomposition of S into the Si can de- 
pend on the time steps. So we write S{, S|, ■ ■ ■ , S' nt for the decomposition of 
the system S at time t and we have: Sj(t + 1) = /i-(sj(t)) where the h\ are 
the “local” evolution functions of the Sj. The “global” state s(t) of the sys- 
tem S can be recovered from the “local” states of the subsystems: it exists 
a function ip' such that s(t) = <p'{s\{t), . . . , s nt {t)) which induces a relation 
between the “global” evolution function h and the local evolution functions: 
s(f + 1) = h(s(t)) = p' {h\ (si (t) ),..., h' n± (s n± (t) ) ) . 

The successive decomposition S* , 5|, . . . , S' nt can be used to capture the el- 
ementary parts and the interaction structure between these elementary parts 
of S. Cf. Figure 1. Two subsystems S' and S" of S interact if it exists some 
Sj such that S', S'' € S'. Two subsystems S' and S" are separable if it exists 
some Sj such that S' £ S' and S'' ^ S' or vice-versa. This leads to consider the 
set S , called the interaction structure of S, defined by the smaller set closed by 
intersection that contains the S'. 



S( 0) 



5 











js? 



s(l) 



■ S '. 1 



S[ 



s(t) 



s' g y(5) 



Fig. 1 . The interaction structure of a system S resulting from the subsystems of ele- 
ments in interaction at a given time step. 



The set S has a topological structure: S corresponds to an abstract simplicial 
complex. An abstract simplicial complex [Mun84] is a collection S of finite non- 
empty set such that if A is an element of 5, so is every nonempty subset of 
A. The element A of S is called a simplex of S ; its dimension is one less that 
the number of its element. The dimension of S is the largest dimension of one 
of its simplices. Each nonempty subset of A is called a face and the vertex set 
V(S), defined by the union of the one point elements of S , corresponds to the 
elementary functional parts of the system S. The abstract simplicial complex 
notion generalizes the idea of graph: a simplex of dimension 1 is an edge that 
links two vertices, a simplex d of dimension 2 can be thought of as a surface 
whose boundaries are the simplices of dimension 1 included in d , etc. 

Our proposal is to specify a discrete dynamical system starting from its 
interaction structure. Our idea is to define directly the set S with its topological 
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structure and to specify the evolution function h by specifying the set S\ and the 
functions h\. The result is an experimental programming language called MGS. In 
MGS, the interaction structure S is defined as a new kind of data structures called 
topological collections and a set of functions h\ together with the specification 
of the S\ for a given t are called transformation. A transformation is a function 
defined by the cases S'. 

This abstract approach has two main advantages. The first one is that it 
enables the easy specification of dynamical systems with a dynamical structure 
(DS) 2 . In such systems, the phase space cannot be defined a priori and must be 
computed jointly with the state of the system (in other word, the set of states 
must be an observable of the system itself, see [Gia03]). The second advantage 
is that this abstract approach enables an homogeneous and uniform handling 
of several computational models including CA, lattice gas automata, abstract 
chemistry, Lindenmayer systems and several other abstract reduction systems. 

The rest of the paper is organized as follows. We begin with a short intro- 
duction to the MGS experimental programming language, through the notions of 
topological collection and transformation. Then, section 3 illustrates the appli- 
cation of these notions on a simple CA. The same diffusion-aggregation process 
is considered on two increasingly more complex topologies. Finally, we review 
some related works and the perspectives opened by this research. 

2 A Brief Presentation of the MGS Language 

MGS embeds a complete, impure, dynamically typed, strict, functional language. 
We focus on the notions required to understand the rest of the paper and we 
only describe here the major differences between the constructions available in 
MGS with respect to functional languages like OCAML [Ler96]. 



2.1 Topological Collections 

The distinctive feature of the MGS language is its handling of entities structured 
by abstract topologies using transformations [GM02] . A set of entities organized 
by an abstract topology is called a topological collection. Topological means here 
that each collection type defines a neighborhood relation inducing a notion of 
subcollection. A subcollection S' of a collection S' is a subset of connected ele- 
ments of S and inheriting its organization from S. 

Topological Collection and the Representation of a (DS) 2 State. A topological 
collection is used to represent the complex states s of a dynamical system S at 
a given time t. The elements of the topological collection are the elements S' 
of V(S) and each element has a value given by s'. In the context of topological 
collections, the elements of L(S) are called positions and we say that s' is the 
value associated to the position S'. Often there is no need to distinguish between 
the positions and their associated value. In this case, we use the term “element 
of the collection” . 
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Collection Types. Different predefined and user-defined collection types are avail- 
able in MGS, including sets, bags (or multisets), sequences, several grids and ar- 
bitrary topologies. We introduce the collection types along with the examples. 

For any collection type T, the corresponding empty collection is written ():T. 
The join of two collections C\ and C 2 (written by a comma: C\ ,Cf) is the main 
operation on collections. The comma operator is overloaded in MGS and can be 
used to build any collection by joining its two arguments. To spare the notations, 
the empty sequence can be omitted in the definition of a sequence: 1,2,3 is 
equivalent to 1,2,3, ():seq. 



2.2 Transformations 

Transformation are used to specify the evolution function of a (DS) 2 . The global 
transformation of a topological collection s consists in the parallel application of 
a set of local transformations. A local transformation is specified by a rule r that 
specifies the replacement of a subcollection by another one. The application of 
a rewriting rule cr => /(cr, ...) to a collection s: 

1. selects a subcollection Si of s whose elements match the pattern a, 

2. computes a new collection .s' as a function / of .s, and its neighbors, 

3. and specifies the insertion of .s' in place of Sj into s. 

One should pay attention to the fact that, due to the parallel application 
strategy of rules, all distinct instances Si of the subcollections matched by the a 
pattern are “simultaneously replaced ” by the /(s,;). 

The MGS experimental programming language implements the idea of trans- 
formations of topological collections into the framework of a functional language: 
collections are just new kind of values and transformations are functions acting 
on collections and defined by a specific syntax using rules. Transformations (like 
functions) are first-class values and can be passed as arguments or returned as 
the result of an application. 

Path Pattern. A pattern cr in the left hand side of a rule specifies a subcol- 
lection where an interaction occurs. A subcollection of interacting elements can 
have an arbitrary shape, making it very difficult to specify. Thus, it is more 
convenient (and not so restrictive) to enumerate sequentially the elements of the 
subcollection. Such enumeration will be called a path. 

A path pattern Pat is a sequence or a repetition Rep of basic filters. A basic 
filter BF matches one element. The following (fragment of the) grammar of path 
patterns reflects this decomposition: 

Pat ::= Rep \ Rep , Pat Rep ::= BF \ BF / exp BF ::= cte | id | <undef > 

where cte is a literal value, id ranges over the pattern variables and exp is a 
boolean expression. The following explanations give a systematic interpretation 
for these patterns: 
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literal: a literal value cte matches an element with the same value. For example, 
123 matches an element with value 123. 

empty element the symbol <undef> matches an element with an undefined 
value, that is, an element whose position does not have an associated value, 
variable: a pattern variable a matches exactly one element with a well defined 
value. The variable a can then occur elsewhere in the rest of the rule and 
denotes the value of the matched element, 
neighbor: b, p is a pattern that matches a path which begins by an element 
matched by b and continues by a path matched by p, the first element of p 
being a neighbor of b. 

guard: p/exp matches a path matched by p if boolean expression exp evaluates 

to true. 

Elements matched by basic filters in a rule are distinct. So a matched path is 
without self-intersection. 

Right Hand Side of a Rule. The right hand side of a rule specifies a collection 
that replaces the subcollection matched by the pattern in the left hand side. 
There is an alternative point of view: because the pattern defines a sequence of 
elements, the right hand side may be an expression that evaluates to a sequence of 
elements. Then, the substitution is done element-wise: element i in the matched 
path is replaced by the zth element in the r.h.s. This point of view enables a 
very concise writing of the rules. 



2.3 Short Example 

We give an example that imply the transformation of a sequence of elements. 

Bubble Sort, in MGS. The bubble sort consists in (1) comparing two neighbors 
elements in a sequence and swapping them if they are not in order; (2) repeating 
the first step until a fixed point is reached. This specification is immediate in 
MGS and can be written: 

trans bubble_sort ={ x,y/(x>y)=>y,x } 

The keyword trans introduces the definition of a transformation by a 
set of rules. Here there is only one rule. This transformation can be 
applied to a sequence s := 4, 2, 3, 1 until a fixed point is reached: 
bubble_sort [ ’ iter=’f ixpoint] (s). The value of the predefined optional pa- 
rameter ’ iter indicates that the application of the function bubble_sort must 
be iterated until a fixed point is reached. The results is 1 , 2 , 3 , 4 as expected. 

3 (DS ) 2 on Complex Topologies 

In this section we present the use of MGS to model and simulate more complex 
systems. Our running example will be a diffusion limited aggregation process on 
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different spatial domains. Diffusion Limited Aggregation, or DLA, is a fractal 
growth model studied by two physicists, T.A. Witten and L.M. Sander, in the 
80’s. The principle of the model is simple: a set of particles diffuse randomly on 
a given spatial domain. Initially one particle, the seed, is fixed. When a mobile 
particle collides a fixed one, they stick together and stay fixed. For the sake 
of simplicity, we suppose that they stick together forever and that there is no 
aggregate formation between two mobile particles. 

This process leads to a simple CA with an asynchronous update function or 
a lattice gas automata with a slightly more elaborate rule set. The purpose of 
this section is twofold. Firstly, we want to show that the MGS approach enables 
the specification of a simple generic transformation that can act on arbitrary 
complex topologies. Secondly, we show how to specify MGS topological collections 
that correspond to standard CAs. 

3.1 The DLA Evolution Function in MGS 

The transformation describing the DLA behavior is really simple. We use two 
symbolic values ‘mobile and ‘fixed to represent respectively a mobile and a 
fixed particle. There are two rules in the transformation: (1) if a diffusing particle 
is the neighbor of a fixed seed, then it becomes fixed; (2) if a mobile particle is 
neighbor of an empty place, then it may leave its current position to occupy the 
empty neighbor. Note that the order of the rules is important because the first 
has priority over the second one. Thus, we have : 

trans dla = { 

‘mobile, ‘fixed => ‘fixed, ‘fixed 
‘mobile, <undef> => <undef>, ‘mobile 

} 



3.2 DLA on Uniform Topologies: GBF 

Group-based data fields (GBF in short) are topological collections used to define 
topologies with a uniform neighborhood: a position cannot be distinguished only 
by looking at its neighbors. This implies for example that each position has the 
same number of neighbors. 

A GBF is an extension of the notion of array where the elements are indexed 
by the elements of a group called the shape of the GBF [GMS95,GM01]. For 
example: 

gbf Grid2 = < north, east > 

defines a GBF collection type called Grid2, corresponding to the Von Neumann 
neighborhood in a classical array (a cell above, below, left or right - not diagonal) . 
The two names north and east refer to the directions that can be followed 
to reach the neighbors of an element. These directions are the generators of 
the underlying group structure. The r.lr.s. of the GBF definition gives a finite 
presentation of the group structure [Slra90]. The list of the generators can be 
completed by giving equations that constraint the displacements in the shape: 
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gbf Torus2 = < north, east; 12*east = 0, 40*north = 0 > 

defines a 40 x 12 torus: if one starts from a position and follows 12 times the 
east direction, one finds oneself at the same position. Another example is the 
definition of an hexagonal lattice that tiles the plane: 

gbf Hexa2 = <east, north, northeast; east + north = northeast > 

Each cell has six neighbors (following the three generators and their inverses). 
The equation east + north = northeast specifies that a northeast move is 
equivalent to a move following the east direction followed by a move follow- 
ing the north direction. 

A GBF value of type T is a partial function that associates a value to some 
group elements (the group elements are the positions of the collection and the 
the empty GBF is the everywhere undefined function). The topology of T is 
easily visualized as the Cayley graph of the presentation of T: each vertex in the 
Cayley graph is an element of the group and vertices x and y are linked by an 
edge labeled g if there is a generator g in the presentation such that x + g = y. 

Figure 2 represents the final state of a DLA process, where each initially 
mobile particle has been fixed. The plot at the left is the application, until a 
fixed point has been reached, of the transformation dla to a Torus2. The plot 
at the right is the fixed point reached by the same transformation dla to a 
Hexa2. In this last simulation, particles are initially packed on the right and the 
initial static particle was on the left part of the lattice. Particles are constrained 
to move on a rhombus. This explains the asymmetries of the figure. 




Fig. 2. Example of DLA on two different topologies: a torus (left) and an hexagonal 
meshes (right) 



3.3 Arbitrary Topology: DLA on Cellular Complexes 

Beyond Graphs. The interaction structure of the previous examples can be ade- 
quately described by a graph: two positions are connected by an edge if they are 
neighbors. Sequences correspond to linear graphs and GBFs provide Cayley’s 
graphs with regular connectivity. This last family of topologies is convenient to 
represent regular spaces where all elementary parts have the same spatial prop- 
erties, but they cannot be used for instance to model a sphere (there is no regular 
graph on a sphere that corresponds to the Cayley graph of some group) . 
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This shortcoming motivates the development of more powerful (arbitrary) 
topologies. First we would like to represent irregular meshes. However, we also 
want to represent heterogeneous spaces. As an example, let consider the electro- 
static laws. They depend on the geometry of the system and some values must 
be associated to a dimension: the distribution of electric charges corresponds 
to a volumic density while the electric flux through a surface is associated to a 
surface. Note that balance equations often link these values, such as the Gauss 
theorem for our electrostatic example. See also the work of E. Tonti [Ton74] for 
an elaboration. 

As a consequence, the interaction structure that describes a system S may 
contains simplices with dimension greater than one. In general, any component 
of any dimension and their interactions should appear in the description of the 
system, and we should be allowed to associate some values to them. 

Arbitrary Topological Collection as Cellular Complex. The right framework to 
develop a topological collection type that allows the representation of arbitrary 
topologies is the combinatorial algebraic topology theory. 

In this framework, a topological collection is a cellular complex : a collection 
of objects of various dimension called k-cell , where k is the dimension. To be 
more practical, 0-cells are vertices, 1-cells are edges, 2-cells are faces, 3-cells are 
volumes, etc. To build some arbitrary complex domain, the domain is divided 
into a cellular partition. Each cell represents a simple part of the domain and 
the cells are glued together: a fc-cell c\ is incident to a (k — l)-cell C 2 if C 2 C dci, 
where dc\ denotes the border of C\. This boundary relation d can be used to 
specify the neighborhood relationships in a topological collection: two k-cells 
are neighbors if they share an incident ( k — 1 )-cell or if they are incident to a 
same ( k + 1 )-cell. This definition of a topological collection is consistent with 
the previous one. 

G-map in MGS. There are several specializations of the notion of cellular complex. 
For instance abstract simplical complex evoked in the introduction are special 
cases of cellular complexes. In MGS one can use generalized map or (Gmap) [Lie91] 
to build arbitrary topologies. The topological objects that can be described by 
Gmaps are quasi-manifolds. 

There are several ways to specify and build Gmaps in MGS. A Gmap can be 
build as the result of construction operations like various products, extrusion, 
suspension, pasting, gluing, etc. A perhaps simpler way is to edit manually the 
Gmap in an interactive CAD tool like MOKA 1 and to import the result in MGS. 

The figure 3 shows applications of the DLA transformation on different kinds 
of objects built with Gmaps. As a matter of fact, the change of topological 
collection doesn’t affect the transformation and we still apply the same dla 
transformation. In these examples, the topological collections have dimension 2 
and the values are associated only to 2-cells. The 2-cells are neighbors if they 
have a common edge on their boundary. In the top of the figure, only the 1-cells 
are figured, and in the bottom, only the 2-cells that hold a value are represented. 

http : / /www . sic . sp2mi .univ-poitiers . fr/moka/ 



1 
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Fig. 3. DLA on complex objects (topology and final state). On the left: a sphere with 
18 parallels and 24 meridians. On the right: a Klein’s bottle. 



4 Conclusion and Perspectives 

This paper only focuses on a part of the features available in MGS that can be 
used to develop computer models of complex discrete dynamical systems that 
are concise and mathematically well-founded. The approach has been applied 
successfully to several biological processes (the growth of a tumor, the flocking 
of birds, colonies of ants foraging for food, the heterocysts differentiation during 
Anabaena growth, etc.) as well as more algorithmic problems (flow on graphs, 
various sorting algorithms, Hamiltonian path, prime number generation, etc.). 

The modeling of (DS) 2 through their interaction structure is part of a long 
term research effort [Gia03]. The topological approach presented here provides 
a unified description of several computational models. Obviously, Lindenmayer 
systems [Lin68] correspond to transformations on sequences. Chemical compu- 
tation, as in Gamma [BFM01], can be emulated using transformations on bags. 

There exists strong links between GBF and cellular automata, especially 
considering the work of Z. Roka [Rok94] which has studied CA on Cayley graphs. 
However, our own works focus on the construction of Cayley graphs as the 
shape of a data structure and we develop an operator algebra and rewriting 
notions on this new data type. This is not in the line of Z. Roka who focuses on 
synchronization problems and establishes complexity results in the framework 
of CA. 

The perspectives opened by this work are numerous. From the applications 
point of view, we are targeted by the simulation of developmental processes 
in biology [GM03,GMM04]. At the language level, the study of the topological 
collections concepts must continue with a finer study of transformation kinds. 
Several kinds of restrictions can be put on the transformations, leading to various 
kind of pattern languages and rules. The complexity of matching such patterns 
has to be investigated. The efficient compilation of a MGS program is a long-term 
research. We have considered in this paper only one-dimensional paths, but a 
general n-dimensional notion of path exists and can be used to generalize the 
substitution mechanisms of MGS. 

The sources of the current MGS implementation, as well as several examples 
and technical reports, are freely available at http://mgs.lami.univ-evry.fr. 
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Abstract. Quantum Systems modeling and simulation activity is a challenging 
research area. Several models and applications were proposed in the past with 
different approaches. Special purposes models based on Cellular Automata 
(CA) paradigm were studied as the classical CA present limits in quantum 
system characterization. In this paper we derive a basic qualitative model, 
according to the Cellular Automata Network Model version 2 (CAN2), for 
studying and characterizing a frustrated liner JJA, showing that, when adopting 
particular system conditions, the corresponding model is extremely simple, and 
each component characterizing system can be easily individuated. 



1 Introduction 

Quantum Systems modeling and simulation activity is a challenging research area. 
Several models and applications were proposed in the past with different approaches. 
Special purposes models based on Cellular Automata (CA) paradigm were studied 
[13] as the classical CA present limits in quantum system characterization [18]. An 
example of special CA quantum models is the Quantum cellular Automata, where 
each cell is a quantum dot system [13]. 

In this paper we deal with a particular mesoscopic quantum system: a frustrated 
linear Josephson Junction Array [9,16]. These JJAs are very challenging solid-state 
devices: when they are arranged according to a non-trivial geometry at low 
temperatures, they develop topology order [19,20], allowing for obtaining a qubit 
[7,8] protected from decoherence [6,10,12]. The JJA devices behavior is often 
reproduced, according to the underlying physical system, adopting Monte Carlo 
simulation and exact diagonalization methods. Here we describe our qualitative 
model, for qubit above, trying to capture the main peculiar aspects for a CAN2 [4] 
based model reproduction. 

This paper is the first but mandatory stage towards the definition of the basic 
computational building block for the simulation of an ideal quantum computer. The 
choice of such a system characterization embodies an inner parallelism behavior. In 
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fact the adopted CA computational model that embodies an inner data parallelism is 
the inner parallelism level of a more future complex architecture built with an outer 
quantum parallelism degree. 

The challenging future result, which has an intrinsic difficulty, is to combine and 
synchronize the parallelism levels above. 

The paper is organized as follows: Section 2 deals with the phenomenological 
description of the devices above; Section 3 introduces the Cellular Automata Network 
v. 2 model that takes into account the classical Cellular Automata (CA) computational 
model and its modification in order to be used for macroscopic complex phenomena 
modeling activity; Section 4 deals with the description of a JJA model, obeying to 
particular conditions, according to a CAN2 computational model; finally some 
remarks and future works descriptions conclude this paper. 



2 Josephson Junction Arrays (JJA): The Physical Model 

Josephson junction arrays (JJA) are a very useful tool for investigating quantum- 
mechanical behavior in a wide range of parameter space. In fact there exists a couple 
of conjugate quantum variables, the charge and phase ones on each superconducting 
island, which results in two dual descriptions of the array [2]: 

1 . through the charges (Cooper pairs) hopping between the islands, 

2. through the vortices hopping between the plaquettes. 

Furthermore, in the presence of an external magnetic field charges gain additional 
Aharonov-Bohm phases and, conversely, vortices moving around islands gain phases 
proportional to the average charges on the islands [1]. Such basic quantum 
interference effects found application in recent proposals for solid state qubits for 
quantum computing, based on charge [17] or phase [3,11,14] degree of freedom in 
JJAs. 

“Charge” devices operate in the regime 

E c ))Ej 



where: 

e 2 ti 

• E c = is the charging energy and Ej = — I c is the Josephson coupling 

2 C 2e 

energy; 

• C is the capacitance of each island, and 

• Ic the critical current of each junction; 

while “phase” or “flux” devices are characterized by strongly coupled junctions with 

Ej ))E C . 

Recently new kinds of JJAs have been proposed [6,12], with non-trivial geometry, 
which allow for a novel state at low temperature characterized by a discrete 
topological order parameter [19,20]. In fact, the Hilbert space of their quantum states 
decomposes into mutually orthogonal sectors (“topological” sectors), each sector 
remaining isolated under the action of local perturbations and protected from leakage 
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through a gapped excitation spectrum. If we choose the two qubit states from ground 
states in different sectors we get a 2 K -dimensional subspace (K is the number of big 
openings in the array, in general equal to the effective number of qubits) which is 
“ protected ” from the external noise [10]. Once the protected qubit has been 
implemented, global operators must be found in order to manipulate its states. Further 
steps are the implementation of non-trivial two-qubit operations and the construction 
of a register of qubits. It is evident that such arrays could be promising candidates for 
a physical implementation of an ideal quantum computer [7,8]. 

After such a brief account of general features of "protected” IJAs qubits, we focus 
on the simplest physical array one can devise to meet all the above requests, that is a 



1 , 1 

ring array of N Josephson junctions with half flux quantum ( — <P 0 = — 



f he' 
\2e/ 



threading each plaquette [9,16]. The number of plaquettes must be correspond to the 
number of junctions, so that each plaquette contains two junctions. 



4- 


0- junction 


•%[ — i — i — i — i — i — i — i — ° 


71- junction 


\ H H H {> i} 


boundary 

condition 





Fig. 1 . JJA with N=10 junctions (0 and n) and N=10 loops in a ring geometry. 

Such a condition is fulfilled when each plaquette of the ring contains an odd 
number of so called n- junctions (a n- junction is characterized by a it shift in the 
phase of the wave function on one side of the junction, so that the current-phase 
relation is I=Ic sin((p+7t)), one in our case. 

The array above, sketched in Fig. 1 for N=10, has a ground state twofold 
degenerate with antiferromagnetic ordering as a result of currents circulating 
clockwise and counterclockwise in odd and even plaquettes respectively, hence it can 
be mapped into a linear antiferromagnetic chain of half-integer spins. 



3 CAN and CAN2: Two CA Based Computational Models for 
Micro and Macro Complex Systems Simulation 

CA, in the classical model [21,22] based on a regular spatial grid of cells, each cell 
embedding identical Finite Automata (FA), whose input is the states of neighboring 
cells; FA have an identical transition function applied simultaneously to each cell. 
The neighboring cell is defined in terms of a spatial pattern, invariant in time and 
space. At the time t=0, FA are in an arbitrary state and the CA evolve changing the 
state of all the FA simultaneously at discrete time steps, according to the FA transition 
function. 
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The following informal properties characterize a CA: 

• a regular discrete lattice of cells, with a discrete variable at each cell assuming a 
finite set of states, 

• an evolution law, called transition function, that takes place in discrete time steps, 

• each cell evolves according to the same rule depending only on the state of the cell 
and a finite number of neighboring cells; the neighborhood relation is local and 
uniform. 

In implementing CA applications, due to the finite storage and finite computational 
power of existing machines, many choices about the components of a CA have to be 
made [21]. Here we give a brief comment on possible choices regarding lattice 
geometry and neighborhood size: 

• lattice geometry i.e. selection of a specific lattice geometry can be one-, two- or 
three-dimensional; 

• neighborhood size i.e, the choice of the neighborhood type for each cell, for 
instance for a two-dimensional lattice patterns as Von Neumann and Moore types 
are generally used. 

CA proved to be an effective computational model in simulating micro and 
macroscopic phenomena, although CA are inspired by a-centrism [15] the modeling 
activity of real macroscopic phenomena requires some extension of classic CA model 
considering at first the following issues: 

1 . Dimension of the corresponding cells must be fixed in order to map the elementary 
portion of the considered region area to each cell grid. 

2. Clock unit must be fixed corresponding to the time step considered for each 
transition function iteration 

Our CA model modification, i.e, the CAN [5] model, extends the standard CA 
model introducing the possibility to have a network of cellular automata, where each 
automaton represents a component of a physical system and connections among 
network automata represent a disjoinable evolutive law that characterizes the system 
to be simulated. 

The CAN model can be applied when the construction of complex phenomenon 
models can be obtained by means of a reduction process in which the main model 
components are identified through an abstraction mechanism together with the 
interactions among components. According to the CAN model it is possible to 
simulate a two-level evolutionary process in which the local cellular interaction rules 
evolve together with cellular automata connections. 

The first version of the model allowed to simulate phenomena composed 
exclusively by CA component; a novel version CAN2 [4] extends the CAN model 
allowing for the introduction of global operators as possible nodes of a CAN network. 

As in CAN model, it is possible to simulate a two-level evolutionary process 
according to two different transition step types: a micro transition, i.e, the cellular 
automaton component step; and a macro transition, i.e. the whole time step network 
evolution. 

The presence of global operators is necessary when an external influence, to the 
macroscopic complex phenomena, assumes a control role and allows for expressing 
any mechanism that could not be expressed in terms of local interaction. Furthermore 
these mechanisms could not influence the evolution of the phenomenon. 

In some cases the introduction of global operators could be useful. Such cases 
includes: the derivation of statistics on the system, i.e. average on global 
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characteristics, or approximation accuracy, deriving approximation between the 
modelled phenomenon and the real one, or global controls. 

The presence of global operators, introducing another different computational unit, 
allows to define a renewed CAN model [5] i.e. CAN2 [4]. The original version limits 
each component to be a a cellular automaton. 

According to the network abstraction, a precedence relation must be introduced 
between each component of the model if some components use variables owned by 
other ones. More explicitly: a precedence relation, between a global operator and a 
cellular automaton, occurs if a global operator accesses a state/s values of the cellular 
automaton; or in an opposite direction a cellular automaton accesses a global variable 
values, defined in a global operator. A correspondent relation can be derived between 
cellular automata themselves and global operators themselves, when they access 
respectively states and global variables. 

A network of cellular automata can be represented as a graph, the CAN precedence 
graph. Figure 2, where: 

• nodes represent cellular automata or global operators, and 

• edges represent precedence relations between nodes. 

Let us now explain with an example the concepts above. 

For instance, let N be a network composed by three nodes N p N 2 and N,; where 
respectively N,, N, are cellular automata nodes and N, is a global operator. 

Let us suppose that: N 2 defines state S,, N 2 defines state S, and N, defines global 
variable g. If at each macro step the value S 3 is accessed by N 2 and S 2 is accessed by 
N 3 the network abstraction, by means of a graph and its precedence relation whose 
verse is depicted by the dark arrow, is shown in the following figure 




Fig. 2. A CAN2 Network abstraction example 



4 A Basic CA Based Model for Frustrated Linear JJA 

In order to model our physical system some considerations and initialization must be 
done. In particular different computational approaches can be combined to describe 
the entire system. 

In order to prepare the initial CA variable and parameter input values is necessary 
to approach the problem according to PDE tools. For our model we choose, as an 
example, an array with N=10 plaquettes/junctions; a minimum of two elements must 
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be used in order to capture all the interesting physics of such a system [9,16], but we 
need a large N in order to better and better fulfill the request of a topological 
protection against local fluctuations [7,8]. 

Subsection 4.1 deals with this issue, the following Subsection introduce the CA 
basic model. 



4.1 System Model Characterization 

In order to obtain a high level system description some abstractions on the physical 

system must be done: 

• lattice grid: is a linear one with N=10 cells 

• the cell dimension must be well defined in order to achieve a good system 
discretization. The total length of the array L (i.e. the sum of N=10 cells) is to be 
defined such that the following condition is true: L )) X, (where X, is the 
Josephson penetration length) [16], in order to meet “long” total junction 
requirement; 

• each cell is identified with a Josephson device node of two junctions of 
different type, regarding the cell center, identifying a right and a left side; 

• boundary condition: the topology is annular 

• each cell can be characterized by: 

• cell labeling, to identify each array cell; 

• current (we assume to stay in a stable state); 

• spin features, i.e. pseudospin assuming values (-1 o 1); 

• capacitance (equal for each cell); 

• phase_difference, phase shift, assuming different role according to cell side 
consideration, left (1) or right side (r), it will result phase_difference_rl, 
phase_difference_ll, phase_difference_r2, phase_difference_12 (where 1 and 2 
are the two possible ground states); 

• a frustration single sawtooth magnetic pulse B. 

At the staring point the phases differences are initialized as follows: 

The corresponding cell is related to the junctions as follows: 

1 . odd cells presents 0 junctions to its left and n type at its right 

2. even cells presents n type junctions to its left and 0 at its right 

At the system definition stage the mapping between cells and initial phase 

differences can be assigned. In order to explain immediately this mapping we express 

the relation and correspondence by means of the following table: 



Table 1. Mapping between cells and phase differences 



phase_difference 


even cell 


odd cell 


rl 


0 


+ 7t 


11 


0 


- K 


r2 


+ 7t 


0 


12 


- n 


0 
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The phase_differences, in Table 1, obtained after a PDE resolution [16] that 
involves them, are input values for the corresponding CA model and remain fixed 
thorough its whole evolution. 

The pseudospin state is related to the phase difference behavior: in correspondence 
to well defined phase difference scheme pseudospinup and pseudospindown are 
expressed as in the following two tables, table 2 and table 3, for even and odd cells 
respectively: 



Table 2. Mapping for even cells 





phase_diff 


pseudospinup 


pseudospindown 


rl and 11 


0 and + n 




X 


r2 and 12 


0 and - 71 


X 





Table 3. Mapping for odd cells 





phase_diff. 


pseudospinup 


pseudospindown 


rl and 11 


+ Ji and 0 


X 




r2 and 12 


- 7l and 0 




X 



This is the initial pseudospin configuration, according to the phase differences 
above. 



4.2 A Basic CA Definition for the System Evolution 

According to the consideration in the previous subsection, now we give a formal 
definition in terms of a CAN2 representation for the basic linear JJA system, under 
study, as follows: 

<R,X,S,G,P,Pvar,f,g> 

• R = {x| x eN} is the set of integer points in the finite region, the array, when the 
system evolves; each point identifies a cell. The lattice grid: is a linear array of 
with N=10 cells. 

• X, is the neighborhood set, when it occurs the set includes {x — l,x +1} for each 
cell x. 

• S=S,xS 2 xS,xS 4 is the set of state values; it is specified by the cartesian product of 
the finite sets of values of the three following substates respectively : 

• Pseudo_S pseudospin assuming (-1,1) values, 

• Mp magnetic pulse for each cell, fixed and invariant for each macro time step. 

• LABEL cell label to identify the cell, corresponding to a monotonic 
enumeration for all cells, itself invariant for each time step. 

• FLIP flip state to register if pseudopspin flipping taken. 

• G is the set of global variables B tot is the total applied magnetic pulse. 

• P is the finite sets of global constant parameters: 

• I current inside the cell 

• C capacitance 






A Basic Qualitative CA Based Model 255 



• Pvar is the finite set of CA component variables: 

• STEP is the step iterator useful to trigger the evolution. 

• f :S — > S is the deterministic state transition for determination of the pseudospin 
state and values. 

• g: S 2 — > G expresses the global operator which controls the total magnetic pulse 
applied on the system. 

In view of the implementation of a system protected form decoherence (see Section 
2) boundary condition topology is annular. 

In order to have a transition between the two ground states the magnetic pulse 
period must be related to the CA time step; we have two possibilities: 

1. the CA time step is equal to the pulse period, in order to capture the maximum 
pulse value; 

2. the CA time step can be arbitrary, but a pulse value threshold must be chosen. 

The first solution is more simple, we adopt this, with a modification. 

The flipping procedure between the input and output states implements a tunneling 
between two ground states which correspond to the physical processes of creation and 
annihilation of fluxon-antifluxon pairs. In general for N cells, (N\2) double flips are 
needed to switch from |0> to |1> . In the quantum regime such tunneling processes 




Function Transition CA 
Begin 

STEP=STEP+1 

If Mp ( i ) >0 

IF (FLIP =0 and 
(LABEL mod 4 ) = 0 or 

(LABEL mod 4) =1) ) 

IF (Pseudo_S= -1 and 

THEN Pseudo_S = 1 

ELSE 

Pseudo_s = -1 

EndlF 

ELSE 

IF (FLIP =0 and 

(FLIP ( i — 1 ) = 1 or 
FLIP ( i+1 ) =1) 

IF (Pseudo_S= -1 and 
THEN Pseudo_S = 1 

ELSE 

Pseudo_s = 1 

EndlF 

CALL Function Transition CA 
until STEP not = 2 

EndlF 

EndlF 



Fig. 3. The JJA system components according to a CAN2 vision. 
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display (N\2)! paths along they can occur. For this reason we choose to use a double 
step for the CA transition component, with each time step equal to the half of the 
single sawtooth magnetic pulse period. 

The CA component has, as initial condition, the pseudospin configuration obtained 
in the precedent stage (cfr. Subsection 4. 1 ) since the pseudospin configuration must 
obey to an antiferromagnetic arrangement and flipping state is zero. 

At the initial time, our device is in a steady state, in one of the two possible ground 
states. 

Each parameter is fixed, also the LABEL values are fixed for all transition steps, 
but the variable STEP is initialized to each macro step T. 

The transition function takes into account coupling factor, adding an external 
frustration, as a single sawtooth magnetic pulse acting on each lattice cell. 

The system transition is assumed to be composed by a global operator application 
followed by a transition function: 

F : SxGxPxPvar — » SxGxPvar that is the composition of the global operator and 
transition function of the cellular automata component twice applied. 

Let us now describe the system transition, expressed in pseudocode, for the global 
operator and the transition function as sketched in the boxes of the figure 3. 

The net result is the NOT-operator. 



5 Conclusions and Future Work 

In this paper we describe the first step in building up a model, using the CAN2 
computational model, for an elementary “protected” solid state qubit. The adopted 
boundary conditions allow for obtaining a well defined elementary device: two stable 
degenerate ground states and the transition between the two ground states. The net 
result is the NOT operation problem of controlling the amplitude and the relative 
phase of the superposition between the two ground states will be addressed in a future 
work. This is the basic building block for the implementation of an ideal quantum 
computer. For the future we plan to study and simulate the aggregation mechanisms 
between such basic blocks via CAN2 approach (basic quantum logical gates). 
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Abstract. This paper reports an efficient compression technology for 
voice data. It is developed around Programmable Cellular Automata 
(PCA) structure. The program executed on PCA is so designed that it 
ensures high speed transmission of voice data with desired level of secu- 
rity (encryption). Further, the regular and local neighborhood structure 
of CA is employed to design high speed low cost implementation of the 
encompression (encryption + compression) scheme. Comparison with 
standard GSM (Global System Mobile) technology, in respect of com- 
pression ratio, the quality of D’encompressed voice data and the quality 
of security attained, confirms high potential of the proposed scheme. 



1 Introduction 

The user community of this cyber age are concerned due to hacker’s intrusion 
into their private messages transmitted as voice data over the public network. 
Further, the growth of channel bandwidth will always lag behind the demand for 
transmission of larger volume of voice data from the user community. Efficient 
and secured voice data transmission demands due attention on following areas: 

— an efficient compression technique to reduce the data rate at source, 

— an encryption scheme to ensure secured transmission, and 

— low cost high speed execution of both the operations. 

In this context, we propose an integrated technology referred to as ENCOM- 
PRESSION supporting both ENcryption and COMPRESSION. The objective 
of this new technology is to ensure high speed secured transmission of voice data 
with low cost hardware. The major contributions of this research work are: 

* This work is supported by AICTE: File No: 8020/RID/TAPTEC-46/2001-2002 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 258—267, 2004. 
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Fig. 1. Encoder and Decoder 
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Fig. 2. Basic principle of Encompression 



— design of a compression technology for voice data with additional support of 
security referred to as encompression, 

— achieving high speed encompression through hardware implementation, 

— design of a VLSI architecture that executes a program on a regulars structure 
Programmable CA (PCA), to realize the encompression operation. 

The compression ratio and quality of encompressed/d’encompressed data are 
comparable to that of GSM [1] , while the security of encompression is comparable 
to Advanced Encryption Scheme (AES) [2]. An overview of the scheme follows. 

2 Overview of Voice Data Encompression 

A voice data file, to be compressed, is input in bit serial mode (Fig. 2). It is 
then transformed by DCT (Discrete Cosine Transform) [3]. The transform co- 
efficients are encoded with Cellular Automata (CA) based lossy compression 
technique (Stage 1). The compressed data enters in streaming mode to Stage 2 
and a block (of size same as that of encryption key ’k’) gets encrypted with CA 
based Cryptosystem (CAC). In order to recover the original data, at the receiver 
end, the encompressed data is first decrypted at Stage 3 and then decoded in 
Stage 4. The original voice data file is reconstracted through inverse DCT. 
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Encompressed File 



Fig. 3. Architecture of Encompression hardware 

2.1 Compression 

In the current implementation, the voice data is transformed into frequency do- 
main by DCT. The (lossy) compression of DCT co-efficients is obtained through 
Vector Quantization (VQ) [4] model. The major components of VQ are - 

— An encoder to encode each block of input file with index of a codevector, 

— A decoder that retrieves back the representative block from the codebook. 

The encoder, as shown in Fig. 1, takes an input vector and outputs the index 
of the corresponding codevector. The index of the codevector is then sent to 
the receiver. The decoder, on receiving this index file, replaces each entry with 
the associated codevector found from the codebook kept in the receiver site. 
However, the VQ scheme has the inherent drawbacks- difficulty in the generation 
of codebook to ensure high compression ratio, the high processing overhead at 
encoding stage. 

The proposed CA based design is targeted to overcome the above limitation 
of VQ. In the current design, rather than developing general compression scheme 
for any arbitrary class of data files, we concentrate on developing the compression 
scheme for a specific class of data - that is, voice data. It reduces the problem of 
designing efficient codebook. We design CA based hierarchical clustering scheme 
to model the compression operation at each node of VQ binary tree. A set of CA 
organized as a binary tree is generated. The tree is equivalent to the traditional 
codebook of VQ in the sense that the centroid of a leaf node cluster represents 
a code vector. 

2.2 Encryption 

The encryption process employs a series of reversible CA transforms on the com- 
pressed data to arrive at the encompressed data file. Four levels of CA transforms 
are employed to achieve the desired level of security [5,6]. The operations of CA 
based voice encompression are realized with a PCA program (Section 5). The 
architecture is shown in Fig. 3. The CA theory relevant for the work follows. 
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Fig. 4. State transition diagram of a 5-cell MACA < 102, 60, 204, 204, 170 >. 



3 Cellular Automata (CA) 

In a 3-neighborhood dependency, the next state <?,; ( t + 1) of a CA cell ’i ’ depends 
on itself and on its left and right neighbors, and is denoted as 

qi(t+ 1) = fi(qi-i(t),qi(t),q i+1 (t)). (1) 

cii(t) represents the state of the i th cell at t th instant of time, ‘/i’ is the next 
state function, called rule IZi. There are 2 2 i.e., 256 possible rules. Out of 256 
rules, 14 employ XOR/XNOR logic. The CA employing only such rules is called 
linear/additive CA. In CA based Cryptosystem (CAC), we use reversible lin- 
ear/additive, and non-linear CA. However, for CA based model of VQ we employ 
a non-reversible linear CA referred to as MACA. 



3.1 Multiple Attractor CA (MACA) 

An MACA consists of a number of cyclic and non-cyclic states. The set of non- 
cyclic states forms trees rooted at the cyclic states. A cyclic state with self loop 
is referred to as attractor. Fig. 4 depicts the state transition diagram of a 5- 
cell MACA with four attractors {00000(0), 00011(3), 00100(4), 00111(7)}. The 
states of a tree rooted at the cyclic state a forms the a-basin. A few fundamental 
results of MACA are next noted from [7,8] . 

Defnn: An m-bit field of an n-bit pattern set is said to be pseudo-exlraustive if 
all possible 2 m patterns appear in the set. 



Theorem 1. In an n-cell MACA with k = 2 m attractors, there exists m-bit 
positions at which the attractors generate pseudo-exhaustive 2 m patterns. 



Theorem 2. The modulo-2 sum of two states is a predecessor of 0-state (pat- 
tern with all 0) if and only if the two states lie in the same attractor basin. 
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Fig. 5. MACA based pattern clustering strategy 



Example. Consider an attractor {00011} and the states {01010, 01011} of the 
attractor basin. The module-2 sum of the two patterns is {00001} which is 
a state of 0-basin (Fig. 4). 

If a pair of patterns (V,V) fall in a basin of an MACA, the pattern V'(= V © V) 
falls in the 0-basin Theorem 2. V' is the hamming distance between V & V . 



3.2 MACA for Pattern Clustering 

An n-bit MACA with fc-attractors can be viewed as a natural cluster (Section 
3.1). It distributes a given set of patterns into k distinct groups. The MACA of 
Fig. 4 can be employed to classify patterns into two clusters. Cluster I is repre- 
sented by the attractor basins {00100 and 00111} while Cluster II is represented 
by rest of the basins {00000 and 00011}. As per Theorem 1, the PEF identifies 
the cluster of a pattern uniquely. The PEF yields the address of the memory that 
stores the cluster information. Therefore, Cluster I attractors yield the memory 
addresses {10, 11} while cluster II is identified by the memory addresses {00, 
01} (Fig. 5). If the MACA is loaded with an input pattern V and allowed to run 
for a number of cycles equal to its depth, it traverses through a number of states 
and finally reaches to an attractor state. The PEF of that attractor points to 
the information related to the cluster V belongs to. 



3.3 MACA for Modelling the Conventional Codebook 

The MACA based hierarchical clustering technique and the associated hierar- 
chical MACA tree structure (Fig. 6) model a conventional codebook. Each leaf 
node of the MACA tree represents a cluster of elements - the centroid of this 
cluster represents a codevector. Binary search for an input vector in the code- 
book is implemented with the traversal of MACA tree, as illustrated in Fig. 6, 
till it hits a leaf node. The leaf node represents the best match for the input. 
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Fig. 6. MACA based hierarchical tree structured pattern clustering technique. 



MACA_CODEBOOK 

Co 




Fig. 7. Structure of MACA based hierarchical clustering scheme 



4 CA Based Voice Compression 

The MACA based hierarchical clustering scheme is used to design the codebook 
for lossy compression of voice. The design consists of two steps - design of training 
set, and generation of codebook. 

Training Set: The training set for lossy compression i.e., the ‘Wave’ files, 
contains discrete signal value recorded at a sampling frequency of 8KHz. A set 
of such files are recorded from different speakers. The DCT co-efficients of these 
sample files, represented as 32-bit floating point numbers, form the training set. 
Codebook Generation: Let us consider a codebook is to be generated for the 
four clusters So, Si, S2 and S3 from a training set Co = {{So}, {Si}, {S2}, {S3}}. 
At the first level, we generate two clusters C\ = {{So}, {Si}} and C2 = 
{{S2}, {S3}} from Co- An MACA ( MACA 0 ) is designed to classify the patterns 
of Ci and C2. The similar process is next repeated for Ci and C2 to generate 
four clusters {So}, {Si} and {S2}, {S3} employing two MACA - M AC A] and 
MACA-2 (Fig. 7). A leaf node S, of the MACA tree is represented by the cen- 
troid Pi € Si of cluster S,;. The MACA tree of Fig. 7 is equivalent to binary VQ 
tree, representing a codebook. 

For test case, we need to identify the codebook entry (i.e., the codeword) 
closest to the given vector P t . At the first stage, the M AC A 0 is loaded with 
Pi and allowed to run. It returns the desired cluster Ci or C2. In next level, 
MACAi or MACA2 , depending on the results of first stage, is loaded with Pi 
to output the desired cluster S,; and its centroid. The path traversed in MACA 
tree generates a binary bit string, 0 for left and 1 for right at a node of the 
tree. For example, 00 represents the leaf node Sq (Fig. 7). This bit string is the 
compressed data. 
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Table 1. Comparison of CA based Voice Compression and GSM 



Name of 


Compression Ratio 


SNR (db) 


PSNR (db) 


NRMSE | 


Voice file 


CA 


GSM 


CA 


GSM 


CA 


GSM 


CA 


GSM 


l.wav 


5:1 


5:1 


10.52 


9.13 


17.41 


14.15 


0.30 


0.35 


2.wav 


5:1 


5:1 


10.43 


7.74 


17.05 


13.25 


0.30 


0.41 


3.wav 


5:1 


5:1 


11.58 


8.31 


19.13 


13.18 


0.26 


0.38 


4.wav 


5:1 


5:1 


11.64 


8.79 


19.30 


13.57 


0.26 


0.36 


7.wav 


5:1 


5:1 


11.49 


6.98 


18.85 


12.68 


0.27 


0.45 



Table 2. Comparative Study of Differential Cryptanalysis for CAC/DES/AES 



Input file 
size (MB) 


Avg. Std. Devi” of 
XOR distributions 
for CAC (%) 


Avg. Std. Devi” of 
XOR distributions 
for DES (%) 


Avg. Std. Devi” of 
XOR distributions 
for AES (%) 


1 


4.36 


31.95 


4.2 


2 


4.30 


30.03 


4.0 


4 


4.26 


29.05 


3.63 


6 


4.17 


28.24 


3.62 


20 


3.59 


27.67 


3.24 



For the experimentation, we have taken samples of 10 male speakers to form 
the training set. Voice data of 7 other male speakers are used to test the perfor- 
mance of our scheme. A comparison with the GSM [1] algorithm based design 
in terms of quality of the reconstructed voice, compression ratio, Signal to Noise 
Ratio (SNR) [9], Peak Signal to Noise Ratio (PSNR) [10] and Normalized Root 
Mean Square Error (NRMSE) [11] are reported in Table 1. The columns heading 
’CA’ notes the results of CA based compression scheme, and the heading ’GSM’ 
refers to the performance of GSM. The results of in Table 1 establish that the 
CA based voice compression achieves higher SNR and comparable speech quality 
as that of GSM. 

5 CA Based Cryptosystem 

The details are reported in [6]. A brief overview along with its refinement over 
that represented in [6], is next provided. 

The first level of encryption is a linear transform of the input key and also 
on plain text, the later one being implemented by rotating each byte of the 
token. At the second level, an affine transform is introduced with an additive CA 
[7] . A non-affine transformation is next implemented by employing a non-linear 
reversible CA [5,6]. Finally, a linear transform (XOR operation) is employed. 

The experimental results of Differential cryptanalysis and Shannon’s Security 
Quotient for CAC with the AES [12] and DES [13] are shown in the tables 2 and 
3. The results establish that the security level of CAC is better than DES and 
comparable to AES. Further, the CAC executes faster than the AES (Table 4). 
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Table 3. Comparative study of Shannon’s Security Quotient for CAC/DES/AES 



Input file 
size (MB) 


Shannon’s Security 
Quotient ($) 
of CAC (%) 


Shannon’s Security 
Quotient ($) 
for DES (%) 


Shannon’s Security 
Quotient ($) 
for AES (%) 


2 


14.1605 


14.2374 


14.2345 


3 


11.5527 


11.5531 


11.5706 


7 


7.5640 


7.9141 


7.6014 


8 


7.1182 


7.1468 


7.7046 


15 


5.2097 


5.3157 


5.5552 



Table 4. Comparison of execution time of software version of CAC and AES 



Input file 
size (MB) 


AES 
(in Sec) 


CAC 
(in Sec) 


1.06 


1.01 


0.47 


2.78 


2.52 


1.20 


4.46 


4.20 


2.07 


7.68 


8.67 


4.21 


23.76 


29.53 


14.61 
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Fig. 8. Architecture of Encompression decoder 



6 VLSI Implementation of Encompression Function 

The proposed architecture for Encompression, encoder and decoder, are shown in 
Fig. 3 and Fig. 8. The basic unit of the encoder is Programmable CA (PC A). The 
encompression function is incorporated in the program stored in the programmed 
memory. The programmed instruction has the following structure: 

— <CA rule vector (Field 1)>, <Number of clock cycles (Field 2)>, <Optional 
tag field (Field 3)> 

The PCA configured with the rule vector (Field 1) runs the specified (Field 
2) number of clock cycles. The optional field controls the program flow exe- 
cuting conditional branch. A complete Encompression Program is stored in a 
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Fig. 9. Universal Programmable CA (UPCA) 



programme memory. Each Programme instruction is implemented with the pro- 
posed architecture of UPCA (Universal PCA, Fig. 9). A UPCA can be configured 
with a given rule vector as a linear, additive, and non-linear CA. To execute a 
program step, the UPCA is configured by the rule vector specified in Field 1 of 
Programmed instruction. One bit index buffer (Fig. 3) is used to implement the 
MACA traversal. When the traversal follows next path, the bit value is pushed 
into the index register. We implement Control Block that generates signals to 
read the program from memory, to configures the UPCA, and run it for specified 
cycles. 

7 Conclusion 

This paper presents a CA based encompression technology of voice data. It en- 
sures efficient transmission of voice data with desired level of security. The mod- 
ular and cascadable structure of CA provides low cost high speed execution. A 
hardware architecture has been proposed for VLSI realization of encompression. 
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Abstract. In the present work, we describe a fast Motion-Planner for 
Mobile Robots. Considering robots moving with smoothed trajectories on 
variable terrains, we have developed an algorithm based on an anisotropic 
propagation of attracting potentials on a non-Euclidean manifold. The 
optimal collision-free trajectories are found following the minimum val- 
ley of a potential hypersurface embedded in a 4D space. The planner is 
very flexible: it can be use on a wide class of vehicles with different kine- 
matics and with generic shapes. Because of the latest property, it is also 
applicable to plan the movements of generic objects (e.g. in assembly 
workstations in manufacturing industry) as in Piano Mover’s problem. 

Thanks to the underlying Multilayered Cellular Automata (MCA) archi- 
tecture, it is a distributed approach. This planner turn out to be very 
fast, allowing to react to the dynamics of the world, evolving toward new 
solutions every time the environment changes without to be re-initialized. 

1 Introduction 

The work presented in this paper concerns Mobile Robots Path-Planning ex- 
ploiting Multilayered Cellular Automata. The Path-planning problem is very 
important to drive mobile robots avoiding the collisions with obstacles. In our 
work, we consider robots with different types of kinematics. The realization of 
a path-planner able to face with different types of motion is quite problematic. 
Another difficulty arise from the shape of the robot. When a robot moves around 
obstacles, its real shape and size must be considered to avoid collisions. Most of 
the work in literature handle this problem substituting the real shape with an 
equivalent shape, most of the time a circular hull enveloping the robot silhouette. 
This solution increases the space occupancy of the robot and it is feasible when 
the robot moves in wide spaces, but not in very cluttered environments. A further 
complication derives from the space geometry in which the robot is situated. In a 
flat world, as office-like structured environments, the geometry is quite simple: it 
is a planar surface (Euclidean 2D space). We want to face different situations, as 
motion on natural terrains, where the Euclidean metrics is no longer applicable. 
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This has an important consequence on the evaluation of the trajectory length, 
and on the optimality of the solutions: the metric changes from point to point 
and must be evaluated locally. During the last twenty years, many authors have 
proposed different solutions, based on geometrical descriptions of the environ- 
ment (e.g. [7,8]). In the eighties, Khatib in [4] first introduced a new method for 
the collision avoidance problem in a continuous space for a 6 DOF manipulator 
robot. This alternative approach is less precise, but more efficient: the Artificial 
Potential Fields Methods. Jahanbin and Fallside introduced a Wave Propagation 
algorithm in the discretized Configuration Space ( C-Space ) ( Distance Transform 
[3]). Barraquand et al. in 1992 [2] used the Numerical Potential Field technique 
on the C-Space to build a generalized Voronoi Diagram. Zelinsky extended the 
Distance Transform to the Path Transform [12]. Pai and Reissel in [10] intro- 
duced a Motion Planning algorithm on multiresolution representation of terrains 
using wavelets. The path finding is based on a variation of Dijkstra’s algorithm 
on a regular 2D lattice. In [5], the authors presented a Path-Planner for outdoor 
terrain based on the Control Theory and the technique of random trees (RRT). 
These methods are very precise, but the performances do not allow to use them 
on real-time systems (about 15 sec to get a path). We used CA as a formalism 
to merge the grid model of the world (Occupancy Grid) with the C-Space of a 
robot and Numerical (Artificial) Potential Field methods, with the aim to find 
a simple and fast solution for the path-planning problem for mobile robots with 
different kinematics. This method uses a directional (anisotropic) propagation 
of distance values between adjacent automata to build a potential hypersurface 
embedded in 4D space. Using a constrained version of the descending gradient 
on the hypersurface, it is possible to find out all the admissible, equivalent and 
shortest (for a given metric of the discretized space) trajectories connecting two 
configurations of the robot C-Space. 



2 Problem Statements 

One of the most important types of world models is the Configuration Space [6, 
8]. The C-Space C of a rigid body is the set of all its configurations q (i.e. poses). 
If the robot can freely translate and rotate on a 2D surface, the C-Space is a 3D 
manifold R 2 x SO(2). It can be modelled using a 3D Bitmap QC ( C-Space Binary 
Bitmap ), a regular decomposition in cells of the C-Space, represented by the 
application QC : C — > {0, 1}, where Os represent non admissible configurations. 
The C- Potential is a function U(q) defined over the C-Space that ’’drives” the 
robot through the sequence of configuration points to reach the goal pose [2] . Let 
us introduce some other assumptions: 1) space topology is finite; 2) the robot 
has a lower bound on the steering radius (non-holonomic vehicle). The latter 
assumption introduces important restrictions on the types of trajectories to be 
found. 

Cellular Automata are automata distributed on the cells of a Cellular Space 
Z™ (a regular lattice) with transition functions invariant under translation: 
f c (•) = f(-),Vc € Z”, f (•) : Ql- 4 °l where c is the coordinate vector iden- 
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tifying a cell, Q is the set of states of an automaton and Aq is the set of arcs 
outgoing from a cell to the neighbors. Robot Path-Planning Problem can be 
solved with CA in a quite simple way: every cell of the C-Space Bitmap QC is an 
automaton of a CA. The state of every cell is a vector that contributes to build 
the C-Potential U(q) through a diffusion mechanism between neighbor cells. 
The trajectories are found following the minimum valley of the surface U(q). 
We used a Multilayered Cellular Automaton [1], where each layer corresponds 
to a subset of the state vector components. Each subset is evaluated in a single 
layer and depends on the same attribute of the neighbor cells in the same layer 
and depends also on the states of the corresponding cell and its neighbors in 
other layers. In the following section, we describe each layer and the transition 
function implemented in its cells. 

3 Architecture Overview 

The Motion-Planner is organized in two major subsystems: an Input Subsystem 
and an Output Subsystem (Fig. l.a). They are subdivided in 6 sublayers (4 + 2), 
some of them are statical and others evolve during the time. The Input Subsys- 
tem is thought as an interface with the outside environment. Its layers have to 
react as fast as possible to the ’’external” changes: the robot starting pose, the 
robot goal pose, the elevation map of the terrain and, the most important, the 
changes of the environment, i.e. the obstacles movements in a dynamic world. 
The first three layers ( Starting Pose L., Goal Pose L. and Terrain L.) are consid- 
ered statical, because they are update from the outside at a very low frequency 
(lower than the internal updating frequency); the Obstacles L. is also updated 
externally, but it evolves testing the movements admissibility as described in 
section 3.1. In Fig. l.a, are shown the functional dependencies between layers, 
and the layers updating order, while in Fig. l.b, the conceptual hierarchical 
structure. The Output Subsystem returns the results of the planning, that is the 
complete set of trajectories from the Path Extraction L., or a single motion step 
from the Attraction L. 



3.1 Terrain Layer and Obstacles Layer 

The Terrain Layer contains the description of the surface on which the robot 
navigates. It is an elevation map, discretized in correspondence to the cells of 
the workspace. We consider only 2D continuous ( C 1 ) manifold embedded in a 
3D Euclidean space (e.g. Fig. 2. a). On the terrain are placed obstacles the 
robot has to avoid. The Obstacles Layer is used to map both the obstacles 
and the environmental boundary (Fig. 2.b); the combination of the two layers 
is shown in Fig. 3. In Regular Decomposition world models, the obstacles are 
decomposed in cells full or empty (Occupancy Grids) and the robot is often 
represented as a point (the robot cinematic center) moving from one free cell to 
a neighbor free one. To take into account of its real extension, the well-known 
technique of enlarging the obstacles by a given quantity (the robot maximum 
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Fig. 1. Layers Architecture: a) layers dependency graph; b) layers hierarchical structure 




Fig. 2. A simple example of: a) Terrain Layer; b) Obstacles Map Layer 




Fig. 3. Combining Terrain and Obstacles Layers 
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radius) can be used [7]. An anisotropic enlargement [7], i.e. a different obstacle 
enlargement for each robot orientation, would solve only partially the problem for 
asymmetric robots: counter-examples can be found where the robot still collides 
with obstacles due to the sweeping of its silhouette between two consecutive 
poses (Fig. 4.c). We adopted a different and more precise approach [9] to address 
this problem, defining a set of admissible robot movements in every cell. The 
admissibility of each move is influenced: 1) globally, by the specific kinematics of 
the robot; 2) locally, by the vicinity of an obstacle and by the robot shape. The 
attribute evaluated in this layer ( Obstacles Attribute) is a boolean: Obst c {6 , /r) £ 
{True, False}, representing the admissibility of each move in each robot pose. 
Therefore, it is a 4D CA and the transition function is an application: Mxl 2 x 
SO(2) —X {True, False}. 



Obst c {6, n,t+T) 



False if 3a £ A c (6, fx) : isObstacle(c + a) A (c + a) £ QC 
Obst c {0,n,t) otherwise (keep the previous value) 

(1) 



Vi > 0 ,Vc £ QC,V9 £ T>, V fj, £ M, where A c is the neighborhood of the cell c 
(e.g. in Fig. 4.d). 
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Fig. 4. Silhouette sweeping: a-b) expanded obstacle (hatched) for two robot orien- 
tations; c) a counter-example due to a coarse discretization of the orientation; d) a 
motion sweeping silhouette/cell neighborhood (hatched cells) 



As a matter of fact, this layer is decomposed in sublayers, one for each robot 
orientation and move: it is itself a Multilayered CA on a 2D domain (the space 
M 2 of positions), as previously shown in Fig. l.b. It is a particular CA also for 
another reason: each sublayer has a non-standard fixed architecture [11], i.e. the 
neighborhood does not have the standard square shape. Its shape reflects the 
motion silhouette of the robot during a movement (sweeping) as in the example 
of Fig. 4.d. In Fig. 5 is shown the evolution of this layer for a rectangular robot 
moving in the Occupancy Map of Fig. 3. The grey levels are proportional to the 
number of admissible moves, ranging from white (all moves are admissible) to 
black (no move). 
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Fig. 5. An example of the Obstacles Layer evolution for a rectangular Robot of Fig. 3. a 



3.2 Attractive Potentials Layer 

This is the core of the entire Path-Planning Algorithm. The Attraction to Goal 
Layer is defined as: Attr c (d out ) £ N 8 . It is a vector of eight values correspond- 
ing to the eight main directions. It represents the integer distance of the cell 
c from the Goal cell when the robot moves with the d ou t £ D direction along 
a collision free path. It is a digitalized representation of the C-Potential func- 
tion U(q) defined on the C- Space Bitmap. To evaluate the path length, we have 
introduced a set of costs (called metric ) for the basic robot movement: ( for- 
ward , forward-diagonal, direction-change, stop, rotation, lateral, backward, back- 
ward-diagonal). These basic movements are combined in different ways to obtain 
different robots kinematics. It is not a metric in the mathematical sense: it has 
been called metric because it defines the procedure to evaluate the trajectory 
length in the space. The robot can be subjected to non-holonomic constraints, 
therefore not every movements can be done (e.g. car-like vehicle). To face with 
different kinematics, we have introduced a subset of admissible moving directions 
D'{c, d) C D depending on the robot position (cell c) and orientation d compiled 
off-line on the base of the robot kinematics. Given the configuration ( c,d ), the 
robot can arrive with a move p to a configuration ( 7 , <5) , V 7 £ /' (d) , V<5 £ D'( 7 , d ) , 
where /'(d) C I c is the subset of neighbor cells reachable from the configuration 
(c, d), and £>'( 7 , d) C D is the subset of admissible orientations at the destination 
cell. The transition function is defined as follows: 



Attr (c, d,t+ 1) 



1 if c = goal -.cell A d = final -direction 

MinAttr (c, d, p, t) if 3p £ M : Obst M (c, d, t) A 

A Attr (c, d, t) ^ MinAttr (c, d, p, t) 
Attr (c, d, t) otherwise 



(2) 



Vi > 0,Vc £ (?C,Vd £ D, where: 

MinAttr(c,d,p,t) = miny^gM {Attr ('y(p), S(p), t) + Cost^ (c, d)} and 
Costfj,(-,-) is the length/cost of the move p from the current configura- 
tion (c, d) to the next configuration ( 7 , 6), and Obst ^ (c, d, t) is the admissibility 
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value of the same move generated in the Obstacle Layer. Changing the metric 
(movement costs), we can realize robots with different types of kinematics. 
For example, the kinematics (2, 3, 1, 0, High , High , High , High) emulates a car- 
like kinematics moving only forward (Dubin’s vehicle), while the kinematics 
(2, 3, 1,0, High, High, 2, 3) emulates a common car-like kinematics moving also 
backward (Reed’s and Slrepp’s vehicle). A robot also translating in any direction 
(omnidirectional) has a kinematics like: (2, 3, 1, 0, 1, 2, 2, 3). Two main properties 
can be demonstrated: the termination of the propagation and the absence of 
local minima. The later property is very important: it is possible to achieve the 
goal (the global minimum) just following the negated gradient vector without 
stalling in a local minimum. The cost function Cost^(-,-) up to now was 
depending exclusively on the move p ( costal )). The admissibility of the move 
p was depending only on the robot pose (c, d) , because of the presence of the 
obstacles distributed around the robot. This is correct on a flat environment 
(Euclidean space), where the distance between cells does not depend on the 
position (the space metric is constant), but it is not correct any more for variable 
terrains, where there is a vertical component too. In this case, the cost must 
depend on the 3D distance between points on the surface, i.e. on the difference 
of elevation between two consecutive positions: it is necessary to include the 
gradient of the surface in the cost function. To make a fast computation, it 
is usual to approximate the gradient value with the absolute difference, thus 
obtaining: Costae, d) = cost(p) + \ Elev(c,d) — Elev('y(p), 6 (h)) |, where the 
first term is the horizontal component and second is the vertical component of 
the displacement. The graphical representation of the potential hypersurface 
(4D) built over the terrain surface is quite difficult. To realize its aspect, we 
must project it from the 4D space to a 3D space, obtaining the skeleton laying 
on the hypersurface, which gives an approximate idea of its topology [9]. This 
skeleton is a tree, with the root in the goal cell, and where the links are the 
admissible robot movements that connect one cell to each other. 



4 Algorithm Properties 

This algorithm has the advantage of being computable in an asynchronous way: 
there is no specific evaluation order of the cells in each layer. Another important 
property is related to the consistency of the solution found. For a given terrain 
surface, the solution found (if it exists) is the set of all shortest paths that connect 
the starting cell to the goal cell. The CA evolution can be seen as a motion from 
one point to another point of the global state space until an optimal solution is 
reached. This is a convergence point for the given problem or a steady global 
state. If we make some perturbations, such as changing the environment (e.g. 
moving an obstacle) or changing the goal cell, then the point becomes unstable 
and the CA starts to evolve again towards a new steady state, finding a new set 
of optimal trajectories. This property is called Incremental Updating. The CA 
spontaneously evolves to a new steady state from the previous one, without to 
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Fig. 6. An example with a car-like kinematics: a) Attraction Potentials Hypersurface 
(3D skeleton); b) Paths projected on the 2D robot workspace; c) Trajectories on the 
elevation map 



be reset and re-initialized, therefore realizing a Reactive Path-Planner: a path- 
planner that reacts to external changes. 

5 Experimental Results 

We have generated some synthetic elevation maps and introduced an obstacle 
distribution to observe the algorithm behavior. An interesting property of this 
algorithm is the simultaneous computation of trajectories from more than one 
starting position. We exploit this property to show multiple problems in the same 
environment. In the example of Fig. 6, the terrain has a group of three ” hills” 
in the middle, and we consider five different starting points of the robot and one 
goal (bottom-left). From any position, the robot tries to move around the hills, 
unless the shortest path is to pass over the hills (in any case, at the minimum 
elevation). In Fig. 7 an U-shaped robot, with omnidirectional kinematics, has to 
be inserted around an obstacle, with three possible starting positions. To avoid 
obstacle collision, it has to maneuver around the obstacles in the narrow passages 
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Fig. 7. An example with an omnidirectional kinematics: a) Attraction Potentials Hy- 
persurface (3D skeleton); b) Paths 3D Skeleton; c) Paths projected on the 2D robot 
workspace; d) Trajectories on the elevation map 



to arrive correctly aligned to the final position. In this case, the problem is quite 
constrained due to the obstacles next to the goal, and quite close to each other. 

The performance tests, carried out with an Intel Pentium IV 2.26 GHz PC, 
gave the following results: 0.18 s (Fig. 6); 0.89 s (Fig. 7). These mean times 
are evaluated over 100 iterated experiments for each problem and include the 
initialization phase of each test. The complexity of a path-planning algorithm 
is always strictly related to the obstacles distribution. It is quite impossible 
to evaluate it because of the huge number of possible situations. We can find 
only a good upper-bound estimate. In the worst cases, the longest paths cover 
nearly 1/2 of the total number of cells N of the Workspace Bitmap , and require 
nearly 2^-N cells updates to be computed. Thus a realistic upper-bound of the 
complexity is 0(N 2 ). Taking into account the obstacles enlargement, the result 
is even better since the number of free cells is lower. 
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6 Conclusions 

In this paper we have described a solution to the Path-Planning Problem for a 
Mobile Robot with a generic shape (user defined) with a generic kinematics on 
variable (regular) terrains with a Cellular Automata approach. Some properties 
of this work can be highlighted: 1) CA is a good formalism when Decomposi- 
tion Models (as Occupancy Grids) are used to represent the environment; 2) it 
is complete: it generates all the collision-free trajectories (or determines if no 
solution exists); 3) the algorithm is quite flexible and can be used with different 
types of kinematics (just changing a set of weights) and with robots of different 
shapes on variable terrains; 4) it is reactive: it is fast enough to allow an incre- 
mental updating of the trajectories every time environment changes are detected 
by the sensors. 
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Abstract. Laser dynamics is traditionally modeled using differential 
equations. Recently, a new approach has been introduced in which laser 
dynamics is modeled using two-dimensional Cellular Automata (CA). In 
this work, we study a modified version of this model in order to simulate 
the dynamics of pulsed pumped lasers. The results of the CA approach 
are in qualitative agreement with the outcome of the numerical integra- 
tion of the laser rate equations. 



1 Introduction 

A laser device generates or amplifies electromagnetic radiation based on the 
phenomenon of stimulated emission. It is basically composed of: 

1. A laser medium: an appropriate collection of atoms, molecules, ions or a 
semiconductor crystal (we will refer to these elements in general as ’’atoms”). 

2. A pumping process that excites electrons from those atoms to upper energy 
levels, due to some external electrical, optical or chemical energy source. 

3. Optical feedback elements that reflects repeatedly the radiation beam into 
the laser medium (in a laser oscillator), or allow it to pass only once through 
it (in a laser amplifier). 

The working principle of laser is stimulated emission: an excited atom can 
decay to a lower state stimulated by the presence of a photon with energy equal 
to the difference between the two energy levels, emitting a second photon with 
the same frequency and propagating in the same direction. The process of ab- 
sorption has the same probability, so stimulated emission dominates only when 
a population inversion is induced in the material by some pumping mechanism. 

A simplified but still realistic model of many real lasers is the four-level laser 
system shown in Fig. 1. The population dynamics of a laser (the variation with 
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Fig. 1 . Schematic view of a four-level laser system. 



time in the number of laser photons and in the population inversion, or number 
of electrons in the upper laser level minus the number of electrons in the lower 
laser level) is usually described as a system of coupled differential equations 
called laser rate equations. 

The rate equations can be put in its simplest form in the case in which 
the life times of electrons in levels E\ and E$ are negligible compared to the 
life time of electrons in level E 2 . Then the electron population of level Ei is 
Ni ~ 0, so the population inversion is approximately equal to the upper laser 
level population N(t) = N 2 (t) — Ni(t) cs N 2 (t ); and the absorption of laser 
photons by electrons at level E\ is negligible. Additionally, the pumping into 
level E 2 can be described by a simple pumping rate R , which is constant for 
many lasers. For a pulsed laser, the pumping is time dependent, R = R(t). 

Under these assumptions, we can take into account only levels Eq and E 2 , 
and write a simplified form of the laser rate equations including the spontaneous 
emission process, which is still realistic to describe some dynamic laser behavior 
-such as relaxation oscillations or gain switching-, as [1,2]: 



dn(t ) 
dt 

dN(t) 

dt 



AT/ , . . n(t) N(t ) 

A N(t) n(t) + e 

Tr. Ta 



m 



® - KN(t)n(t ) 



(1) 

(2) 



Equation (1) gives the variation with time on the number of laser photons, 
n(t), which is related to the laser beam intensity. The first term on its right 
hand side, +KN(t)n(t), represents the increasing on the number of photons 
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by stimulated emission ( K is the coupling constant between radiation and the 
population inversion). The next term, — n(t)/r c , introduces the decaying process 
of laser photons inside the cavity and r c is the decay time of the laser photons. 
The term +c7V(f) /r a represents the spontaneous emission process, where e is 
the fraction of inversion decay processes which are radiative and for which the 
emited photon has the same frequency and direction of the laser radiation and 
T a is the decay time of the population inversion. This contribution was not 
taken into account in our previous study [3]. Here, this term has been taken 
into account in order to get more precise results and to be able to integrate the 
differential equations (1) and (2) with the initial conditions n(0) = 1V(0) = 0, 
because otherwise the solution for the number of photons is always n(t) = 0. 

Equation (2) represents the temporal variation of the population inversion, 
N(t). The term +R(t) introduces the pumping of electrons with a pumping rate 
R to the upper laser level. The term — N(t)/r a represents the decaying process 
of the population inversion with the characteristic time r a . Finally, the product 
term —KN(t)n(t) accounts for the decreasing of the population inversion by 
stimulated emission. 

In this work, the dynamics of a pulsed pumped laser is explored. We suppose 
that the pumping has a pulsed form of width 2 t p which is greater than the life 
time t a of the population inversion. The time dependence of the pumping rate 
that has been used is: 



m = 



Rm ^(t), 

0 ; 



0 < t < 2t p 
t < 0, t > 2 t p 



( 3 ) 



where R m is the maximum value of the pumping rate at the time t p and 



\ [! -cos(^)] 



( 4 ) 



The value of R(t) is represented in Figure 2 with the legend ’’Pumping”. 



The purpose of this work is the study of the CA approach to model the 
dynamics of pulsed lasers and the comparison with the results obtained by the 
corresponding laser rate equations. 



2 CA Model 

We have recently introduced a CA model for simulating laser dynamics, which is 
described in reference [3]. Further details are discused in references [4,5]. This CA 
laser model has a similar nature to other models which have been used to describe 
different reactive phenomena, such as the Belousov-Zlrabotinsky reaction [6] or 
the population dynamics of prey-predator systems [7] . In order to simulate the 
behavior of pulsed pumped lasers, two main modifications from the original 
model have been introduced in this work: 
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i) The pumping probability here is time dependent, whereas in the original 
model it was constant. 

ii) The spontaneous emission process is associated with the decaying of the 
population inversion. This is more realistic than in the original model, where 
this was introduced as a random level of noise photons. 

The laser system is modeled by a 2-dimensional square lattice of 400 x 400 
cells with periodic boundary conditions. The lattice represents a simplified model 
of the laser cavity. 

2.1 States of the Cells 

Two variables a,(f) and d(t) are associated to each node of the lattice. The 
first one, a,;(f), represents the state of the electron in node i at a given time 
t. An electron in the laser ground state takes the value a l (t) = 0, while an 
electron in the upper laser state takes the value a,i(t) = 1. A temporal variable 
di(t) £ {0, 1,2, ...,r a } is also introduced, in order to take into account the finite 
time, r a , that an electron can remain in the upper state. If the electron is in the 
base state d;(f) = 0, otherwise di(t + 1) = fij(f) + 1 until the maximum value t q 
is reached and then di(t + 1) = 0. 

The second variable C;(t) £ {0, 1, 2, ..., M} represents the number of photons 
in node i at time t. A large enough upper value of M is taken to avoid saturation 
of the system. There is also another temporal variable, c* (t) £ {0, 1, 2, ..., r c }, 
which measures the amount of time since a photon j £ {1, 2, ..., M} was created 
at node i. t c is the life time of each photon. For a given photon j, cf (f + 1) = 
c- (t) + 1 until the life time r c is reached and then c? (f + 1) = 0. 

2.2 Neighborhood 

The Moore neighborhood has been used. Nine neighbors are taken into account 
to compute the transition rules of any given cell: the cell itself, its four nearest 
neighbors and the four next neighbors. 

2.3 Transition Rules 

The transition rules determine the state of any particular cell of the system at 
time t + 1 depending on the state of the cells included in its neighborhood at 
time t. The evolution of the temporal variables cq(t) and c] (t) was described 
beforehand. Here we describe only the evolution of cq(t) and Cj(t). 

— Rl. If Oj(t) = 0 then 

<Zj(t + 1) = 1 with probability A (t) 

where A (t) = X m <j>(t) and A m is the maximum value of the pumping 
probability A (t). 
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— R2. If 0,i(t ) = 1 and 7i(t) = E neighbors c * W > S then 

j Ci(t + 1) = Ci(t) + 1 
\ di(t + 1) = 0 

where R is the number of photons included in the neighborhood of the cell 
i, and 6 is a threshold value, which has been taken to be 1 in our simulations. 

— R3. If Ci(t) > 0 and there is one photon j for which c? (t) = r c then 

Ci(t + 1) = Cj(t) - 1 

— R4. If cii(t) = 1 and a,;(f) = r a then 

/ cii(t + 1 ) =0 

\ci(i + l) = Ci(t) + 1 with probability 9 

These transition rules represent the different physical processes at the 
microscopical level in a laser system. Rule R1 represents the probabilistic 
electron pumping process. Rule R2 models the stimulated emission: if the 
electronic state of a cell has a value of ai(t) = 1 and the number of laser photons 
in the neighborhood is greater than a certain threshold, then at the time t + 1 a 
new photon will be created in that cell and the electron will decay to the ground 
level. Rule R3 represents the photon decay. Rule R4 represents the electron 
decay in a similar way to rule R3: after a time r a of the excited electrons, these 
electrons will decay to the ground level. This rule represents both radiative and 
non-radiative processes: a fraction 9 of the electron decay processes involves 
creating a new photon in the same cell. This models the spontaneous emission 
with a probability 9. As in an ideal four level laser the population of level E\ is 
negligible, stimulated absorption has not been considered. 



3 Results 

The expected behaviour of a pulsed pumped laser can be otained by integration 
of the differential equations (1) and (2). The initial conditions that have been 
used are: ?r(0) = 1V(0) = 0 and the values of the parameters are: r a = 45, 
t c = 3, £ = 10 -5 , K = 6 • 10 -6 . The pumping pulse parameters are: R m = 8000, 
t p = 2000. Here r a , r c and t p are measured in time steps , K and R rn in 
( time steps)- 1 . 

In this study, the values of r 0 , r c and t p have been chosen so that r Q and r c 
are much smaller than t p . In this regime, usually referred to as quasistationary 
pumped laser, the solutions of the differential equations consist of a laser emission 
pulse which follows the pumping pulse, and essentially no relaxation oscillations 
appear. Figure 2 shows the results obtained by the integration of equations (1) 
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0 500 1000 1500 2000 2500 3000 3500 4000 

Time steps 

Fig. 2. Evolution of the system obtained by numerical integration of the differential 
equations, using the values of the parameters indicated in the text. The y-axis values 
have been normalized relative to the population inversion of the central plateau. The 
curve with the legend ’’Pumping” is the value of R(t). 



and (2). In the initial phase, the population inversion N(t) increases in response 
to the pumping, but until a threshold pumping is reached, stimulated emission 
is not important and very few laser photons are created. When this threshold is 
reached, the laser process is activated and the number of laser photons n(t) starts 
to increase. When the pumping is over the threshold value, stimulated emission 
reduces the population inversion, so N(t) reaches a maximum. After that, in the 
differential equations solutions N(t) reaches a stable value and n(t) increases 
towards a maximum, to decrease again, following the pumping pulse. Finally, 
when the pumping decreases down the threshold value, stimulated emission is 
no longer important so n(t) decays to zero. After that, N(t) decreases follow- 
ing the tail in the pumping, to reach a zero value when the pumping falls to zero. 

For pulsed pumped lasers the dynamics of n(t) and N(t) are strongly 
dependent on the pumping function R(t). So in order to compare the output of 
the laser rate equations with the CA we must stress that the pumping process 
has a probabilistic nature following rule R1 for the CA, whereas for the laser 
rate equations the pumping is a smooth function. Then a comparison between 
both outputs should be made only from a qualitative point of view, because the 
pumping is not exactly the same. 

A typical result by the CA simulation is shown in Figure 3. For this 
simulation A m = 0.1 and 9 = 0.01. Initially there is no laser photon and all 
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0 500 1000 1500 2000 2500 3000 3500 4000 

Time steps 



Fig. 3. Evolution of the system obtained by running a simulation using the cellular 
automata model, for values of the parameters equivalent to those of Figure 2. The 
y-axis values have been normalized relative to the population inversion of the central 
plateau. The curve with the legend ’’Pumping” is the number of electron cells which 
are pumped by rule R1 in each time step. 



the atoms are in the ground state (dj(0) = 0, Cj(0) = 0). The results of the 
CA simulation reproduce in a qualitative way the behavior exhibited by the 
solutions of the differential equations. It is remarkable that the CA model 
reproduces the plateau in the population inversion, which is one of the main 
features predicted by the equations: after the laser is above threshold, the 
population inversion remains approximately constant. Another feature which is 
reproduced by the CA model is the peak in the population inversion when the 
threshold pumping is reached. A similar -but smaller- peak appears in the CA 
simulation in the phase in which the pumping is decreasing, after the pumping 
threshold is surpassed. This peak does not appear in the equations solutions, 
but has a direct physical interpretation: after the stimulated emission decays to 
zero, it doesn’t contribute to reduce the population inversion, so there is a net 
increase in N(t), until it decreases again due to the fall in the pumping. 

Finally we have found that the results from the integration of equations (1) 
and (2) are not very sensitive on the values of the parameter e. By running 
different simulations changing the value of the equivalent parameter in the CA 
model ( 6 ) , it can be observed that the results are not very sensitive on this value 
either, in good agreement with the differential equations behavior. 
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4 Conclusions 

Cellular automata models like the one used in this work have an interesting 
heuristic nature, which helps to understand the physical phenomena being mod- 
eled, in a more direct way than the differential equations [8,9]. In addition, this 
kind of models can be useful as an alternative modeling tool for laser dynamics 
when numerical difficulties arise, for example in lasers governed by stiff differen- 
tial equations, with convergence problems. Due to their intrinsic parallel nature, 
they can be implemented in parallel computers and offer a great advantage in 
computing time [10,11]. 

In this work, we report a work in progress involving modifications in a previ- 
ously introduced cellular automata model of laser dynamics, in order to simulate 
pulsed pumped lasers. We have compared the results obtained by the numeri- 
cal integration of the laser rate equations for a quasistationary pulsed pumping, 
with those resulting from a simulation by a cellular automata model of this 
type of laser. The results of the CA model reproduce in a qualitative way the 
behavior exhibited by the numerical integration of the equations. After these 
preliminary results, more work will be done in order to compare both results in 
a more quantitative way. 
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Abstract. We study the structural evolution of a growing thin film 
on time scales in the order of seconds and even minutes. This requires 
solving the problem of bridging large time and length scale gaps in simu- 
lating atomistic processes during thin film deposition. We describe a new 
simulation approach inspired by lattice gas cellular automata to address 
this problem. The approach is based on a discrete description of atoms 
so that the unit length scale coincides with the atomic diameter. For 
homoepitaxial thin film deposition, the local driving force is the propen- 
sity of an atom to establish as many chemical bonds as possible to the 
underlying substrate atoms when it executes surface diffusion. The in- 
teraction between atoms is defined using a coarse-grained approach to 
boost the computation speed without heavily sacrificing the atomistic 
details. Simulation results of Si layers deposited on a Si(001) substrate 
are presented. 



1 Introduction 

Epitaxial thin film deposition represents a system that has a large number of 
degrees of freedom. Per unit centimetre, there are about 10 14 surface atomic 
sites for deposited atoms to stick to. To deposit one epitaxial layer of atoms 
onto a substrate, which are in a complete lattice registry with the underlying 
substrate atoms, deposited atoms perform motions on the substrate’s surface in 
order to find equilibrium (lowest potential energy) sites. Microscopically, these 
motions can be broken down into elementary hops, from one atomic site to the 
next, which are driven by the surface energy landscape in addition to energetic 
barriers separating the initial site from the final site. Various interactions that 
follow from the ever-changing landscape due to the simultaneous movements of 
many atoms can be obtained by quantum mechanical simulations. However, the 
huge number of degrees of freedom involved has so far prevented researchers 
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from performing complete multiscale simulations of thin film deposition using 
quantum mechanics [1]. 

We know that the collective motion of these atoms gives rise to a macro- 
scopic surface diffusion current and to a surface morphology. It thus remains 
important to predict the surface morphology based on a reduced microscopic 
picture. Lattice gas cellular automata (LGCA) are a potential method capable 
of coarse-graining the microscopic picture of atomic-scale motions to a manage- 
able computational complexity. 

This paper describes the formulation of LGCA-based computation to sim- 
ulate atomistic processes that make up thin him deposition. It is inspired by 
the LGCA models for chemically reacting systems [2]. The unit length scale 
for our simulation coincides with the atomic diameter and the unit time scale is 
matched to the collective diffusion time constant. The LGCA evolution is defined 
through local rules for the motions of atoms, which are based on the propensity 
of each atom to make as many chemical bonds as possible in order to attain its 
equilibrium state. 



2 The Lattice-Gas Cellular Automaton Description 

Consider an LGCA on a one-dimensional periodic lattice of length L. The lattice 
sites are denoted by r = 0, 1, . . . ,L — 1 ( mod L ) and the discrete time points by 
k = 0,1,.... The state of a site r at time k is given by the integer- valued variable 
h(r, k ) representing the sites Him height, i.e. the number of one atomic-height 
layers. Periodicity of the lattice implies h(r + jL, k) = h(r, k) for all j £ Z. We 
initialise the LGCA with values h(r, 0) for all r at time k = 0. At time k = 0, 
we also initialise the pseudo-random number generator employed to simulate 
random events like the determination of landing sites of Si atoms, see Sect. 3. 

If h(r, k) > 0 then the particle at height h(r, k) is called the adatom at site 
r at time k: all deposited particles at site r below the adatom are called bulk 
atoms. If h(r, k ) = 0 then no adatom is located at site r at time k. We assume 
that all adatoms and only these, are mobile and can move to neighbouring lattice 
sites with probabilities to be computed. The rules that govern this motion will 
be described below. The considerations above imply that the energy of a particle 
does not dissipate with time and that a particle is always mobile as long as it is 
not buried by another one. Since adatoms are always mobile we consider them 
not part of the deposited structure. 

The time evolution of the LGCA from time k to time k + 1 proceeds in 
three sub-steps: (i) particle landing, (ii) adatom velocity computation, and (iii) 
adatom motion. Each sub-step is performed at each lattice site independently 
from the other lattice sites (distributed execution) and the next sub-step is 
started only after the previous sub-step has been completed on all sites of the 
lattice (synchronisation). We introduce the intermediate time steps k! and k" 
which are attained when the first and the second sub-step, respectively, are 
completed. The state of the system can only be observed at times k, i.e. after 
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completion of the third sub-step. We describe the three sub-steps in more detail 
below. 

In order to connect the LGCA model to a real surface growth process, we 
must relate the simulation time k, i.e. the number of simulation steps performed, 
to the real time t (in seconds [s]). We assume that each simulation step from 
simulation time k to k + 1 corresponds to an effective time step At e g ([s]) inde- 
pendent of k and this leads to t = fcZ\t e g. A reason for the fc-independence and 
the precise definition of At e s is given in the end of this section. 

In the description below we make use of the indicator function 1 [ • ] , where 
the argument is a logical expression and \[true\ = 1 and 1 [false] = 0. 



Particle Landing. The deposition of particles on the surface is modelled by 
the particle landing sub-step. We assume a constant and spatially homogeneous 
particle flux, from which particles are randomly, uniformly in space and time, 
deposited on the surface, leading to a growth rate R g (in monolayers per second 
[ML/s]) of the structure. We describe a site- and a lattice-based approach of this 
sub-step and give reasons why we prefer and use the latter in our simulations. 

Site-based approach: Particles land at each lattice site with the particle landing 
probability (per unit simulation time and lattice site) pi € [0, 1] derived from the 
growth rate and the effective time step as 

Pi — RgAt e ff . 

If a particle lands on the surface at site r then the height h(r, k) increases by 
unity. Let {£/(r, k) | r = 0, 1, . . . , L — 1} be a set of independent and identically 
distributed Bernoulli random variables with probabilities P[£;(r, k) = 1] = Pi, 
then we obtain for all r = 0, 1, . . . , L — 1 

h(r,k') = h(r,k) + (i(r,k) . (1) 

Computational concerns regarding the site-based approach: This approach re- 
quires L Bernoulli trials for each execution with, in general, a very small landing 
probability pi. This raises three concerns. 

1. In view of the many time steps necessary for a simulation, the generation of 
L random numbers for the L Bernoulli trials is computationally very costly. 

2. The small probability pi is often even too small to perform the Bernoulli 
trials faithfully with the available range of random numbers, see Sect. 3. 

3. Additionally, in view of the very small probability pi, it appears to be a waste 
of CPU time to perform the Bernoulli trials for each lattice site indepen- 
dently. In fact, the probability that, during one execution of the site-based 
particle landing sub-step, two or more particles will land on the surface is 
given by P [ two or more particles land ] = 1 — (1 — pi) L — Lpi(l — pi) L ~ x , 
and is extremely small for typical values of L and p;, see Fig. 1 (left). 
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Fig. 1. P [ two or more particles land ] for various values of the lattice size L and a 
range of typical landing probabilities pi (left) and the effective time step At e s for 
typical values of corrugation energy E c and growth temperature T (right) 



Lattice-based (coarse-grained) approach: Based on the concerns and arguments 
presented, we propose a second approach of defining the particle landing sub- 
step. This approach is computationally more efficient and for suitable values of 
L and pi physically sound. We assume that at most one particle can land on 
the surface in each execution of the particle landing sub-step. Let (i{k) be a 
Bernoulli random variable with probability of success Lpi and £ s (k) be a uni- 
formly distributed random variable with range {0, 1,2, . . . , L — 1}. Then (i(k) 
decides whether a particle will land on the surface in the execution of the sub- 
step and ( s {k) selects the particular landing site, i.e. 



h(r h , n = / Mg k ) + : if r = £ s {k) 

' ’ ' \ h(r , k) : otherwise 



(2) 



This lattice-based approach requires only the generation of at most two random 
numbers (improved efficiency), the used probability of success in the Bernoulli 
trial is larger than pi (improved reliability), and on average the same number of 
particles are deposited on the surface (maintained consistency) . In the following 
we consider exclusively the lattice-based particle landing sub-step. 



Adatom Velocity Computation. The adatoms on the surface undergo sur- 
face diffusion depending on the surface morphology. Therefore we will define for 
each lattice site r the adatom velocity v(r, k") € {—1, 0, 1} for the adatom cur- 
rently located at this site. This velocity determines the motion of the adatom at 
site r to sites r — 1 , r, or r + 1 , respectively, in the subsequent adatom motion 
sub-step. Note that we allow for an adatom to stay at its site. 

The velocity v(r, k") is computed as a function of the current height at site 
r and neighbouring sites. To do this we derive below (diffusion) probabilities 
p~{r, fc'), p°(r, k '), and p + (r, fc'), with p + (r, k') +p°(r, k') +p~(r, k') = 1, for the 
three motion options. This allows to define the set {£„(r, k') | r = 0, 1, . . . , L — 
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1 } of independent random variables with range {—1,0,1} and corresponding 
probabilities p~(r, k'), p°(r, k'), and p + (r, k'). We then compute 

v(r, k") = 1 [h(r,k') > 0 }£ v (r,k') . (3) 

Eq. (3) implies that the velocity associated with site r is zero if there is no 
adatom. Hence, we consider in the following the derivation of the diffusion prob- 
abilities p + (r, k'), p°(r, k'), and p~(r, k') in the case h(r, k') > 0. The computa- 
tion of the diffusion probabilities is based on the number of free bonds which the 
adatom at site r currently has, b° (r, k') , and the number of free bonds it would 
have if moved one site to the right or left (leaving everything else unchanged), 
b + (r,k') and b~(r,k '), respectively. An adatom always has a free bond to the 
top and possibly free bonds to the left and to the right. We obtain 

b°(r, k') = 3 — 1 [h(r + 1, k') > h(r, k') — 1 ] — 1 [h(r — 1, k') > h(r, k') — 1 ] , 

b + (r, k') = 3 — 1 [h(r + 2, k') > h(r + 1, k') ] — 1 [h(r, k') — 1 > h(r + 1, k ') } , 

b~(r , k') = 3 — 1 [h(r, k') — 1 > h(r — 1, k') ] — 1 [h(r — 2, k') > h(r — 1, k') ] . 

In order to derive the expression for b + (r,k') (a similar consideration holds for 

b~(r, k')) denote by h r ,+ (-, k') the surface profile which is obtained from h(-,k') 
if the adatom at site r’ is moved to site r' + 1, i.e. 

( h(r' + 1, k') + 1 : if r = r' + 1 
h r ,+ (r, k') = < h(r', k') — 1 : if r = r' 

[ h(r , k') : otherwise 



Now b + (r , k') can be expressed as b°(r+ 1, k') with h(-,k') replaced by h r,+ (-, k'). 
Note that the bond counts take the presence of nearby adatoms into account 
which is an important distinction of the LGCA approach from, e.g., kinetic 
Monte-Carlo simulations that consider diffusion of only one adatom at any time. 

The rate that an adatom with b\ free bonds at its current site will hop to a 
site where it would have 62 free bonds is given by 



R(bi, bf) 




E c 

k B T 



E d 

k B T 



(62 




In this expression, E c is the corrugation energy (in Joule [J] ) , Ed is the unit 
broken (dangling) bond energy ([J]), two material properties, T is the growth 
temperature (in Kelvin [K]), k B is the Boltzmann constant (in Joule per Kelvin 
[J/K]), and hp is the Planck constant (in Joule x second [Js]). We see that if 
62 — &i < 0, i.e. when the adatom has less free bonds if moved to the new site, 
that is it achieves a more stable state there, then the rate R(bi,b- 2 ) becomes 
large. In particular, R(bi, 62) is increasing if 62 — b± is decreasing. The expression 
above allows to define the rates R + (r,k'), R°(r,k'), and R~(r,k') at which an 
adatom currently at site r is moving to site r + 1 , staying at site r, or moving 
to site r — 1 , respectively, by 

R+’°--(r, k') = R(b°(r, k'), b + '°’~(r, k')) . 




Surface Roughening in Homoepitaxial Growth 291 



Finally, by normalising these rates with respect to their sum R + (r,k') + 
R° (r, k') + R~ ( r , k ') , we obtain the diffusion probabilities p + (r, k'),p° (r, &/) , and 
p~(r, k’). Note that the corrugation energy E c has no influence on the diffusion 
probabilities; it scales out. 

The following observation is important for computational efficiency of the 
LGCA simulations as it avoids the excessive evaluation of the exponential func- 
tion. The diffusion probabilities p + (r, k'), p~(r , k’), and p°(r, k') depend on the 
differences b + (r, k') — b°(r, k') and b~(r, k') — b°(r, k') only and these differences 
only take values in {— 2, — 1, 0, 1, 2}. Hence, we define matrices P + ,P~ £ R 5 ' 5 
such that p + (r, k') = P^, p~(r , k') = P 1“ and p°(r, k') = 1 - (P£ + P^), where 
i = b + (r, k') — b°(r , k') + 3 and j = b~(r , k') — b°(r, k') + 3. These matrices are 
independent of the simulation time k and, hence, can be set up once and forever 
at the start of the simulation. 



Adatom Motion. The adatoms on the surface are moved to new locations 
according to the velocities calculated in the adatom velocity computation sub- 
step. This sub-step guarantees that v(r, k") = 0 if there is no adatom at site r 
and time k" . The height h{r , k) evolves according to the formula 

h(r,k+ 1) = h(r, k') - 1 [v(r,k") ± 0] 

+ l[v{r — 1, k") = 1 ] + l[i>(r + 1, k") = — 1] . ( ] 

Note that two adatoms may move to a lattice site r in this sub-step. In such a 
case they will be ou top of each other. 



The Effective Time Step At e ff. As pointed out earlier, we relate the simula- 
tion time k to the real time t by the effective time step At e q. The time span At e g, 
equivalent to evolving the LGCA from k to k + 1, corresponds to the average 
time taken by adatoms to jump to a nearest atomic site (including the waiting 
time before the actual jump). Hence, the At. e g corresponding to simulation step 
k should depend on the surface morphology after simulation step k and therefore 
can be computed only after that step has been completed. However, we need 
At e ff within the particle landing sub-step of the kth simulation step. Because of 
this difficulty (which could be avoided by making the particle landing sub-step 
the final sub-step of each simulation step) and, more importantly, for computa- 
tional efficiency we decided to define At e g independent of k and with respect to 
a flat surface as follows. The average time required for au adatom to jump to 
a nearest atomic site is the inverse of the particle hop rate 7?(£>i , 62) • Since we 
assume that the adatom diffuses on a flat surface we have bi = &2 and obtain 



At 



-1 

eff 



k B T 

hp 



exp 




The dependence of At e fi on growth temperature T and corrugation energy E c is 
illustrated graphically in Fig. 1 (right). 
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3 The Simulation of Random Events 

Random events are an integral part of LGCA simulations. We simulate these 
events using a sequence of uniformly distributed (pseudo-) random integers from 
the set {0, 1, 2, M max }, where M max = 2 32 — 1, generated by the pseudo- 
random number generator described in [3] with an initialisation as in [4]. The 
generator is a combination generator which combines a subtract-with-borrow 
generator with a simple Weyl generator. The period of the combination generator 
is not known but, following [3], should be longer, or at least not shorter, than 
the period of any of its parts. The subtract-with-borrow generator has a period 
of about 10 414 . In the end of this section we will bound the required number of 
random integers for an LGCA simulation which should be less than the square 
root of the period of the generator. This condition is satisfied for the simulations 
to be performed. 

The LGCA described requires to simulate three types of random variables 

1. a uniformly distributed random variable £ m with range {0, 1, . . . ,M — 1}, 

2. a Bernoulli random variable £ p with probability of success p £ [0, 1], and 

3. a random variable £± with range {—1,0, 1} and corresponding probabilities 
p + ,p~ £ [0, 1], and p° = 1 — p + — p~ £ [0, 1]. 

We describe how we simulate these random variables in the following and also 
discuss further implementation details. 

Uniformly Distributed Random Variable • We take the next integer n 
from the sequence of uniformly distributed random integers and define = n 
mod M, which is a good approximation if M -C M max . 

Bernoulli Random Variable . We take the next integer n from the se- 
quence of uniformly distributed random integers. To determine the success of a 
Bernoulli random variable with probability of success p , we can check whether 
n/(M max + 1) < p. This computation involves floating point operations (divi- 
sion and comparison) which are computationally more expensive than the corre- 
sponding integer operations. Therefore, especially in view of the many Bernoulli 
trials to be performed with the same probability p, we first and forever define 
p := round (p • (M max + 1)) £ {0,1,2, . . . , M max + 1} and then set £ p = l[n < p ]. 
This involves only one integer operation for each evaluation of £ p . 

Here we remark that due to the finite range of random integers provided 
by the generator we are not able to generate random variables with 0 < 
p < 1/(2 (M max + 1)) ss 1.2 • 10 -1 ° because all these result in p = 0. Therefore 
we recommend to consider only random variables £ p with p = 0 or p > 10 -8 
which should be faithfully simulated with the available sequence of uniformly 
distributed random integers (cf. the discussion in the description of the particle 
landing sub-step in Sect. 2). Similar considerations apply for random variables 
with p close to 1 or if we want to distinguish between random variables £ p 
and £ p where p and p are very close. 
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Random Variable £-|-. To simulate the random variable £± we follow the 
approach to simulate Bernoulli random variables. We define two integers p + := 
round (p + • (M max + 1)) and p~ := round((p + + p~) ■ (M max + 1)) and take the 
next integer n from the sequence of uniformly distributed random integers. If 
n < p + then £± = 1, otherwise if n < p _ then £± = —1, otherwise £± = 0. 

A Bound on the Required Number of Random Integers. From the 
discussion above we see that for the simulation of each random variable required 
in an LGCA simulation we need one number from the generated sequence of 
uniformly distributed random integers. Hence, per LGCA simulation step, we 
need at most two for the lattice-based particle landing sub-step and at most 
L for the adatom velocity computation sub-step. For a simulation up to real 
time t we perform t/At e g simulation steps and, hence, require in total at most 

— ( L + 2) uniformly distributed random integers. 

4 Selected Simulation Results 

In this section we show some typical simulation results of the deposition of Si 
layers on a flat (i.e. h(r, 0) = 0) and on a structured Si(001) substrate. The ana- 
lysis of the deposition on a flat substrate is presented in [5] and on a structured 
surface is in preparation. The parameters for our LGCA simulation are: lattice 
size L = 500, surface corrugation energy for Si(001) surfaces E c = 1.12 x 10 -19 J 
(~ 0.7 eV, [6]), and unit broken bond energy Ed = 3.68 • 10 -19 J (~ 2.3 eV, [7]). 
We consider growth rate values R g £ {0.1, 10}ML/s and deposition tempera- 
tures T £ {700, 800} K. The number of simulation steps is chosen such that on 
average x = 5 monolayers were grown, i.e. xR g ~ x At e f / _1 steps. In Fig. 2 we 
plot snapshots of the LGCA state at different times for one of the simulations. 
We obtain the expected layer-by-layer growth for the initially flat and also the 
structured substrate. However, on the initially structured substrate we observe 
a considerable higher number of small islands as opposed to only two big islands 
on the initially flat substrate. This is the expected behaviour due to the steps of 
the structured substrate which facilitate aggregation. 

As a first qualitative outcome of all our simulations we state that the average 
height h(k) := (1/ L)J2r=o h(r,k) almost perfectly agrees with the expected 
growth R g kAt e ff of the structure. Another qualitative measure, used to analyse 
the roughness of the evolving thin film surface, is the interface width w{L , k), 

L - 1 

w(L, k) 2 = — ^ [h{r, k) 2 - h{k ) 2 } , 

r — 0 

see [5] for plots. For the initially flat surface it is found that w(L, k) behaves 
independently of the growth temperature for constant growth rate. Furthermore, 
for constant growth temperature, w{L 1 k) approaches a smaller value towards the 
end of the simulation time with increasing growth rate. This behaviour, known 
as kinetic stabilisation , is typical in homoepitaxial deposition. Physically, this 
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Fig. 2. Plots of the height function h(r,k) for an initially flat, subplot (a), and an 
initially structured, subplot (b), substrate. In each subplot, the simulation time k 
corresponds to real times t = 10,20,30,40,50 s (from top to bottom) for a simulation 
with growth temperature T = 700 K and growth rate R g = 0.1 ML/s 



can be explained by the dominance of the uniformly advancing growth front due 
to the deposition flux in comparison to the processes that control the adatom 
diffusion [8]. Further analysis can be performed by considering, e.g. the step 
density, the height-height correlation of the surface, or island sizes and their 
distributions. The step density characterises the roughness of the evolving surface 
morphology by measuring the number of steps. A discussion of this measure, with 
respect to an initially flat substrate, and its dependence on growth rate R g and 
growth temperature T is given in [5]. 

We conclude this section with some remarks on the computational complex- 
ity of the LGCA simulations. The computation time for the simulation of a 
structure of, on average, x monolayers, is directly coupled to the size of the ef- 
fective time step At eS . As can be seen from Fig. 1 (right), At e g decreases, and, 
hence, the computation time increases, with increasing growth temperature T 
and decreasing corrugation energy E c . The most time-consuming sub-steps of 
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the LGCA evolution are the computation of adatom velocities and the motion of 
the adatoms because these sub-steps depend (linearly) on the lattice size L. In 
contrast, the lattice-based particle landing sub-step is independent of the lattice 
size. 

5 Conclusions 

We have described a spatially one-dimensional LGCA model for the simulation 
of surface roughening in homoepitaxial growth. In particular, we gave a detailed 
description of the derivation of the automaton rules based on physical consid- 
eration but also taking computational efficiency into account. We discussed our 
preference of the lattice-based approach for the particle landing sub-step of the 
LGCA as opposed to the site-based approach. The simulations performed pro- 
duced results consistent with experimental observations. The presented model 
should be seen as a starting point for generalisations to higher spatial dimen- 
sions and towards more realistic models by incorporating further details of the 
underlying physical processes. These two directions of future research will clearly 
increase the computational complexity of the LGCA and, hence, from a compu- 
tational point of view, interest should be focused on a parallel implementation 
of the LGCA. 
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Abstract. This paper discusses the application of ACS metaheuristics 
(based on behaviour of real ants: stigmergy and synergetic effect among 
ants) for Job-Shop Scheduling problem (JSP). This algorithm is im- 
proved by introducing the concept of critical events, in which two new 
techniques will be applied. Thus, a more flexible heuristic technique is 
obtained, which improves the performance of ant colony system for JSP. 



1 Introduction 

The job-shop scheduling problem belongs to the scheduling problems that are 
the most difficult ones in classical scheduling theory [13]. Note that a well-known 
problem with 10 jobs and 10 machines formulated in 1963 [20] has been solved 
optimally only in 1989 [7]. Only some special cases with a small number of jobs 
or machines can be solved in polynomial times. 

For the J SP without setup times branch and bound algorithms and heuristics 
have been proposed. We may mention algorithms proposed by Carlier and Pinson 
[7] , Brucker et al. [6] and Applegate and Cook [2] . Among heuristics constructive 
and iterative algorithms have been distinguished. In constructive algorithms pri- 
ority (or dispatching) rules have been applied [3,14,16,22]. A mixed, dispatching 
rule has been applied for short-term scheduling problem in [17]. Authors assign 
different priority rules to the machines. Among iterative algorithms some meta- 
lreuristics have been proposed: simulated annealing, tabu search or threshold 
accepting [18,21]. Also genetic algorithm has been considered as an application 
for the JSP [23]. 

The most obvious approach to solving a scheduling problem is a greedy one: 
whenever a machine becomes available, assign some job to it. A more sophis- 
ticated variant of this approach is to give each job a priority derived from the 
particular optimality criterion, and then, whenever a machine becomes avail- 
able, assign the available job of highest priority to it. The priority of a job can 
be determined without reference to other jobs. 

The ant colony systems (ACS) were applied to solve several problems, e.g. 
the travelling salesman problem, vehicle routing problem, coloring graph, shortest 
common supersequence problem or routing in telephone networks [4]. In this 
paper, we present how different priority rules improve the performance of the 
ACS applied to the Job-Shop Scheduling Problem. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 296—305, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




Ant Colony System for JSP 297 



Outline. This paper is organized as follows. The considered problem is de- 
scribed in detail in section 2. In section 3 we demonstrate how the variant of the 
ant colony optimization can be applied to the ACS- JSP. In section 4, we define 
the extended version of ACS for JSP, where we use critical events memory. Fi- 
nally the results of our computational experiments are presented and discussed 
in section 5. We summarize major points and conclusions in section 6. 



2 The Job-Shop Scheduling Problem 

The Job-Shop Scheduling Problem [15] may be formulated as follows. Let M = 
{Mi, . . . , M m } be a set of machines, let J = { Ji . . . , J n } be a set of jobs, and let 
O = {uij}, (■ i,j ) £ I, I Q [1 , n] x [1 , to] be a set of operations, where n denotes 
number of jobs and m denotes number of machines. 

For each operation u t j £ O there is a job Ji it belongs to, a machine Mj on 
which it has to be processed, and a processing time Pij . The set O is decomposed 
into the chains corresponding to the jobs: if the relation Ui P — » Ui q is in a chain, 
then both operations belong to job J,; and there is no Uik such that ut p — > Uik 
or u ik u iq . 

The problem is to find a starting time Sij ( Vw.y £ O) which minimizes: 



max ( ) 

u ij 



subject to 

- > Sik + Pik when -A Uij (the operation precedence 

constraints on the job chains), 

- ( > Skj + Pkj) V ( Skj > Sij + p^) (the machine constraint saying 

that no more than a single job 
can be processed at the same 
time on the same machine). 

The value Cjj = Sjj + Pij is a completion time for the operation Ujj £ O. 
The operations must be assigned to the time intervals in such a way, that 
once an operation is started it must be completed. 

A disjunctive graph 

D = (V, A, E) 

where 

— V — is a set of nodes, 

— A — is a set of conjunctive directed arcs, 

— E — is a set of disjunctive, undirected edges, 

may be associated with an instance of JSP. In such a case V, A and E are defined 
as follows (Fig.l.): 

— V = O U {u 0 } U {uat-i-i}, ({wo} a n d {itAr+i} are the special dummy nodes 
which identify the start and the completion of the overall Job-Shop), 




298 U. Boryczka 



— A = {(uij,Uij. |_i) : Uij Uij + 1 is in the chain for job J^} U {(uo,My) : u\j 
is the hrst operation in the chain for job J;} U {(«j TO , mjv+i) : tt,; m is the last 
operation in the chain for job J,}, for all i, 

- E = {(v,ij,Uhj)}, for all i. 

With each vertex u t j € O, a weight p tJ is associated (po = Pn+i = 0). 




Fig. 1 . Graphic representation of the JSP 3x4 



The length of a path in the graph is defined as the sum of the weights of the 
vertices belonging to the path. Thus, solving the JSP corresponds to finding an 
acyclic orientation of the graph D which minimizes the longest path between 
uq and un+i- In the algorithm solving the JSP, a data structure based on the 
approach used by the List Scheduler algorithms is applied. This algorithms com- 
prise two phases. First, a rule for assigning priorities to the still unscheduled op- 
erations is defined. Then, the operation with the maximum priority is scheduled. 
On each step, the algorithm chooses the operations for which all predecessors 
are already scheduled. As the result the earliest possible starting time with re- 
spect to the precedence and machine constraints is assigned to the operation 
with maximum priority. 



3 Application of the Ant Colony System to the JSP 

The Ant Colony System (ACS) algorithm has been introduced by Dorigo and 
Gambardella [11,12] to improve the performance of Ant System [4], which al- 
lowed to find good solutions within a reasonable time for small size problems 
only. The ACS is based on 3 modifications of Ant System: 

— a different node transition rule, 

— a different pheromone trail updating rule, 

— the use of local and global pheromone updating rules (to favour exploration) . 




Ant Colony System for JSP 299 



The node transition rule is modified to allow explicitly for exploration. An ant 
k in city i chooses the city j to move to following the rule: 

. = f arg max„ gJ fe{[r iu (f)] • [r] iu q < qo 
J l J if q > qo 



where q is a random variable uniformly distributed over [0, 1], qo is a tuneable 
parameter (0 < qo < 1), and J £ Jf is a city that is chosen randomly according 
to a probability 



P k u{t) 



Tu{t) ■ [ rhj \ 0 

E [ru(t)] ■ ImiY 

itj* 



In Ant System all ants are allowed to deposit pheromone after completing 
their tours. By contrast, in the ACS only the ant that generated the best tour 
since the beginning of the trail is allowed to globally update the concentrations 
of pheromone on the branches. The global updating rule is as follows: 



nj(t + n) = (1 - p) ■ Tij{t ) + p ■ ATij(t , t + n) 

where (i,j) is the edge belonging to T + , the best tour since the beginning of 
the trail, p is a parameter governing pheromone decay, and Arij(t,t + n) = jhf, 
where L + is the length of the T + . 

The local update is performed as follows: when, while performing a tour, ant 
k is in city i and selects city j £ jf to move to, the pheromone concentration 
on edge (i, j) is updated by the following formula: 



Tij (t + 1) = (1 - p) ■ Tij(t ) + p ■ T 0 

A JSP described by m machines, n jobs and set of operations O can be 
represented in the context of ACS by a directed weighted graph F = ( U , E) 
where: 

- u = O U {wo}, 

— E = {( UijpUki ) : Uij,uu £ O } U {(uo,itii) : uu is the first operation in the 

chain for job Ji}. 

Node uq is added to specify which job is to be scheduled first, if more than one 
job has its first operation to be performed on the same machine. 

The graph F has N + 1 nodes and + n edges. Each node (except uo) 

represents an operation performed on a machine, and all the nodes are pairwise 
connected in both directions (node u o is connected only to a first operation of 
each job). 

The arcs of F are weighted by a pair ( Tij,rjij ), where Ty represents the 
pheromone trail, and rjij relates to the special priority rule. The weights are 
used to compute the probabilities which tell an ant which operation should be 
chosen in the next step. 

All ants are initially in w 0 and they build a solution by a permutation of the 
remaining nodes. To cope with this problem transition probability is used and 
global and local pheromone updating rules have been performed by ants. 
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Fig. 2. Visualization of a solution made by an ant in the JSP 



4 Critical Events in ACS for JSP 

Our work has investigated the use of “iterative repair” search procedures, which 
start with an infeasible solution and attempt to eliminate conflicts and incorrect 
solutions. In this paper, we propose a new metaheuristic procedure for solving 
job-shop scheduling problems. Our approach is somewhat unconventional. We 
start from a base procedure called ACS for find a critical event in a searching 
process. 

For the JSP examples, we will specify that the critical events of interest 
consist of solutions, that represent a “local suboptimum” i. e. whose objective 
function value is worse than that of the solution immediately before and after it. 
Now we are interested in the effect of restarting the algorithm with a new priority 
rule. It will be happened after the “critical” solution is found. Then the solutions 
that correspond to critical events are the initial solution for the next experiment 
for a special period of time (in our experiments — 50 iterations). To execute 
a restarting procedure, we penalize the worse solutions, implicitly allowing the 
penalty function - pheromone updating rules to decay rapidly these values as 
the number of iterations increases. 

For comparison purposes we included various well-known priority rules from 
job-shop scheduling into our tests, which have been adapted to the considered 
problems. Among the rules considered in our tests we present computational 
results for the following rules: SPT, LPT, SRT, LRT, LRM, ECT, FIFO, NINQ 
[16]- 

There are different reasons of occurrence of critical events in JSP. Once an 
operation has been selected to schedule in this problem, it is necessary to check 
that all the involved constraints are satisfied: 

1. The precedence constraints are defined among operations of the same job: 
it means that the process must guarantee that two operations of a same job 
are not executed in the same period of time. 

2. Checking the consistency of capacity constraints is a difficult process due to 
the next constraints: 

— When an operation is scheduled and a resource is allocated to the oper- 
ation, a forward checking process analyzes the set of remaining possible 
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Fig. 3. An example of different priority rules for mt06 
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Fig. 4. Gantt chart of the schedule before and after FIND-HOLE procedure 



reservations of other operations that requires the same resource, and 
removes those conflicting reservations. 

— We must check if two unscheduled operations that require the same 
resource are not overlapped. Two operations overlap when both require 
the same resource at the same time for every start time. 

— Before indicating an operation has not any possible execution it is neces- 
sary to check the resource usage calendar of the operation. If the shared 
resource is not used during the entire operation’s execution, it can be 
used in another operation. The FIND-HOLE procedure identifies the 
two extreme time points of the temporal line in which the resource is 
available and can be used by another operation. If a hole does not exist, 
an inconsistency will be detected because the resource is not available. 
The FIND-HOLE procedure checks if an apparent conflicting situation 
is actually conflicting. 



5 Examining the Efficiency of the ACS in Job— Shop 
Scheduling Problem 

The experimental work consisted of three major phases. First, an efficient and 
effective parameter settings were determined. Then, ACS -JSP algorithm with 
different priority rules was run several times using this setting to examine the 
behaviour of this algorithm (on the problem ABZ6). Finally the results concern- 
ing the different JSP problems were compared with results obtained by dynamic 
tabu Search and Multi PCP [4]. In order to measure the performance of the 
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□ Average scheduling ■ The best scheduling 




Value of parameters : a andp 



Fig. 5. Value of parameter j3 for 
ACS-JSP 



Fig. 6. Value of parameters: a and 
p for ACS-JSP 



ACS-JSP with two techniques described above, a set of test runs was performed 
using a selection of the classical job-shop problems [1,19,20]. 

The main strength of experiment concerning the first version of Ant System 
(used for JSP) were summarized as follows [10]: 

— With the best parameter values algorithm always finds good solutions (the 
best one about 7% worse from the best-known values. 

— The algorithm quickly finds satysfying solutions, nevertheless it does not 
enter the stagnation behaviour, and the ants continue the search process for 
new, possibly better, solution. 

Our aim is not only to find better results, but also examine the behaviour 
of this algorithm in finding solutions for such difficult problem. It is known that 
the good results obtained by the algorithm depend strongly on the parameter 
setting affecting the computational formulas described in section 3. In the case 
of presented algorithm /?, p and a had to be determined. As the starting point 
for a determination of the role of parameter j3 the following values of the other 
parameters were chosen: q 0 = 0.8, p = a = 0.01, m = 100. The number of 
iterations equals to 200. 
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Table 1 . Makespan results for Job Shop Problems. 



Problem 


Job 

X 

Machine 


Optimal 

value 


Multi 

PCP 


Dynamic 

Tabu 

Search 


ACS mod, 


Error 


LA6 


15x5 


926 


926 


- 


926 


0% 


LA7 


15x5 


890 


890 


- 


915 


2.8% 


LA11 


20x5 


1222 


1222 


- 


1222 


0% 


LA12 


20x5 


1039 


1039 


- 


1047 


0.9% 


LA17 


10x10 


784 


787 


784 


825 


3.9% 


LA23 


15x10 


1032 


1041 


1032 


1089 


5.5% 


LA26 


20x10 


1218 


1272 


1218 


1397 


14.6% 


LA36 


15x15 


1268 


1321 


1268 


1440 


13.5% 



To consider how ant colony flexible affects scheduling performance, the initial 
experimental design was repeated for 20 settings of (3 and three settings for a 
and p. Figures 7 and 8 show the performance results obtained by ACS-JSP for 
the problem ABZ6 with setting of 0.5, 1 to 20 (for (3) and 0.1, 0.01, 0.001 for 
a and p. These parameters occur in global and local updating rules. We adjust 
experimentally the value of parameter /3 observing the histogram Fig. 5. The 
experimental results showed that the value 9 for this parameter is the best one 
(5.4% worse than the optimal schedule c = 1234). Also for examining the value 
of p and a the 10 x 10 ABZ6 was tested with (3 = 9. Both values of parameters 
a and p have the same kind (tendency) of fluctuation. Experimental observation 
has shown that the best value of these constraints equal to 0.01. 

Recapitulating the achieved results, the following set of parameters’ values 
were chosen to perform further tests of critical events in ACS: qo = 0.8, p = a = 
0.01, f3 = 9, to depends on the number of nodes. 

Next experiment was carried out to find the good result for the ABZ6 problem 
(10 x 10) using different priority rules occurred in rj. The investigations were 
performed for the following priority rules: NINQ, SRT, LR.M, LRT. 

In order to discover which priority rule gives the best result (and in what 
sequence) we analysed every of the priority rule mentioned in section 4. We 
cannot point out the best sequence (and its length) unmistakably. All the priority 
rules influence the results in a similar way. We expect the differences between 
the priority rules to appear for the greater and more difficult problems. 

In order to compare the results found by ACS-JSP for different problems, 
we perform the next test (see Tab.l). This table shows the best solutions ob- 
tained by 3 different algorithms: multi PCP [9], Dynamic Tabu Search [8] and 
Ant Colony System with critical events. We chose two algorithms for this com- 
parison, interesting from our point of view. Results obtained by ACS mo d JSP 
unfortunately does not outperform the results obtained by other algorithms, but 
our aim is not to show the best method for JSP. We want to analyse the behavior 
of ants in such a difficult problem. We must underline that it is our preliminary 
research and we hope to cope with this problem. According to the presented ta- 





304 



U. Boryczka 




Fig. 8. Gantt chart of the schedule for ABZ6 



ble, ants can find the best schedule for small problems (LA6, LA11). Therefore 
our investigations will be continued. 

6 Conclusions 

With a rapid increase in creating different hybrid methods we focused on devel- 
opment (or augment) of the ant colony optimisation, which incorporate various 
forms of repairing methods. In our paper two methods for JSP are proposed and 
evaluated. The proposed procedure, which we referred as the „critical event” 
extends the basic algorithm to exploit changeable in time priority rules and re- 
ordering the operations in order to fill the holes. Our research has investigated 
a mataheuristic approach with simple priority dispatching rules. Recently these 
rules are the representative and widely used in different approaches. While the 
priority rules are extremely fast and easy to implement, there are also draw- 
backs. The performance of any given rule is typically quite sensitive to problem 
characteristics, and it is generally quite difficult to find one (or a special sequence 
of them) that dominates in any particular problem, so in the future work we will 
try to find the best sequence (and the duration of exploitation) of priority rules. 
We also try to create more dynamic version of adaptivity to good values of pa- 
rameters of ACS. The more difficult problem is the more dynamic characteristic 
of ACS should be examine. 

The presented algorithm contains elements of several different approaches 
used for JSP and in different metaheurictics (Tabu Search). ACS-JSP is a mod- 
ified version of ACS with two techniques: FIND-HOLE procedure and by using 
different priority rules. The results of our experiments tend to be more attrac- 
tive than those presented in our previous work [5]. ACS-JSP is shown to be 
interesting among modern metaheuristics in term of using it in such difficult 
problem as the JSP is. Future research with more experimental work will aim 
at the motivation for selecting of the ACS for JSP. More generally, the results 
obtained in these experiments illustrate the difficulty of creating an appropriate 
version of ACS for Job-Shop Scheduling problems. 
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Abstract. We used de Bruijn diagrams to analyze traffic models based 
on one-dimensional cellular automata. The first model is the rule 184 
which simulates traffic flow with the simplest dynamics. The second 
model is a rule represented by a cellular automaton of order (4,2), which 
includes all the configurations that can be presented when the model con- 
templates two speeds. To this cellular automata we denominated LCA- 
TRAFFICFLOWVMAX2 which is completely deterministic and we can 
predict or analyze its behavior still before to carry out simulations in 
computer using de Bruijn diagrams. We obtained this model taking as 
it bases the model proposed by Nagel-Schreckenberg to simulate traffic 
flow in one dimension and one direction. 



1 Preliminaries 

The cellular automata were conceived at the beginning of 1950’s by John Von 
Neumann, a mathematical brilliant interested in investigating about the required 
complexity so that a device could be auto-reproductive and the organization 
of this one to have a correct operation when repairing itself with parts that 
could have an operation badly. The results were not let hope, all a current of 
investigation has been developed with a strong interrelation in fields like fractals 
and dynamic systems, as well as applications in: parallel cellular computing, 
simulation of dynamic systems and recognition of patterns, and simulation of 
socio-economical models, for instance. 

In general terms, we can say that a cellular automata is defined and operates 
based on the following elements: 

— States, or attributes, e.g., “on” or “off”, 0 or 1. 

— Containers that store those attributes, e.g., a location in an array, a sector 
in a computer’s memory. 

— Behaviors associated with those states; formulated as conditional transition 
rules (IF, THEN, ELSE) that govern how and when automata react to in- 
formation that is input to them. 
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— Neighborhoods from which the automata draw input, e.g., neighboring loca- 
tions in an array, databases containing information. 

— Temporal domain, through which the evolution of the automata proceeds, 
e.g., real-time, program cycles. 

We can define cellular automata so that it operates in one, two, or more 
dimensions. Nevertheless, the complexity to analyze them or to implement sim- 
ulations of these in a computer is exponentially increased with the number of 
dimensions and states including in its definition. 

The mentioned aspect of complexity previously, has oriented some inves- 
tigators to work with cellular automata that operate in one dimension, since 
this allows them to analyze with more detail the referring aspects to its behav- 
ior. One of the pioneering works about one-dimensional cellular automata made 
Stephen Wolfram [12] during the decade of 1980’s. Professional scientific interest 
in cellular automata received a considerable impetus from the investigations of 
Stephen Wolfram, who undertook a computer based search through the proper- 
ties of one-dimensional automata, guided by some concepts from the realm of 
nonlinear dynamics and statistical mechanics. 

So, according to the Wolfram notation, we can define a one-dimensional cel- 
lular automaton of the following way 

x l +1 = f{ x \- r i : K 2r+1 H> K, (1) 

where r is the neighborhood radius, if is a finite set of states, i £ 7Z , x is 
the automaton and x £ K. The equation (1) defines the evolution rule (p for a 
cellular automata of order (fc,r), where k is the number of states and k £ K. 

From the definition of a one-dimensional cellular automaton we can charac- 
terize its behavior identifying ancestors, finding cycles, search for the garden of 
eden - configurations which cannot evolve from any predecessor - or any other 
pattern of behavior that is of our interest. 

A very useful tool which we can use to characterize the behavior of cellular 
automata is the de Bruijn diagram [9], main tool of shift register theory [1]. Be- 
cause the cells of one dimensional automata, by denition, form linear chains and 
the neighborhoods themselves form another kind of chain, wherein they neces- 
sarily overlap. The result is an arrangement which has a very close connection 
with shift register theory, which is a systematic study of the properties of over- 
lapping strings and how the overlaps happen to come about. In particular, the 
form of graphical representation known as the de Bruijn diagram. 

Several papers have used de Bruijn diagrams for representing and analysing 
the evolution rule of a one-dimensional cellular automata. Masakazu Nasu [8] 
refers the properties of injective and surjective evolutionary functions to de 
Bruijn and related diagrams; Wolfram [13] himself used them to express evo- 
lutionary properties, and Erica Jen [2] has used them to calculate ancestors. 
The de Bruijn diagrams are defined as follows: 

(i) The nodes of the diagram are all the sequences of 2 r cells, i.e. the set K 2r . 
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(ii) Let a,b be states of K and £ 1 , £ 2 sequences of K 2r ~ 1 . Then a£i y ^b are 
sequences of K 2r representing nodes of the de Bruijn diagram. There is an 
arc from a£i to £ 2 b if £1 = £ 2 , and it represents the sequence a£i b G K 2r+1 
which is a complete neighborhood of the automaton. 

(iii) Every arc labelled by the state in which the neighborhood a£i b evolves 
according to the evolution rule <p. 



In this way, paths in the de Bruijn diagram are sequences of symbols formed 
in one-dimensional cellular automata. So, the equation that defines a de Bruijn 
diagram is 

f (ki 

ki + 1 



BM id = { 



1 3 = 



( mod k 2r ) 

ki + k — 1 



(2) 



[ 0 otherwise. 



Another diagram which we can construct from the de Bruijn diagram is the 
subset diagram. The subset diagrams was introduced by Edward F. Moore in 
1956 [5]. In the subset diagram the nodes are grouped into subsets, note being 
taken of the subsets to which one can arrive through systematic departures from 
all the nodes in any given subset. The result is a new graph, with subsets for 
nodes and links summarizing all the places that one can get to from all the 
different combinations of starting points. Sometimes, but far from always, the 
possible destinations narrow down as one goes along; in any event one has all the 
possibilities cataloged. One point to be observed is that if one thinks that there 
should be a link at a certain node and there is not, the link should be drawn 
to the empty set instead; a convention which assures every label of having a 
representation at every node in the subset diagram. 

Let a and b be nodes in a given diagram, S a subset, and |<Sj the cardinality 
of 5; the formal definition of its subset diagram is 

f 0 S = Q) 

Si(S) = l {b\linki(a, b)} S = {a} (3) 

I'S'I > 1- 

There is another important reason for working with subsets. Labelled links 
resemble functions, by associating things with one another. But if two links 
with the same label emerge from a single vertex, they can hardly represent a 
function. Forging the subset of all destinations, leaves one single link between 
subsets, bringing functionality to the subset diagram even though it did not exist 
originally. Including the null set ensures that every point has an image, avoiding 
partially defined functions. 
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2 Rule 184 

The model that represents traffic flow with the simplest dynamics is that in which 
the cars move of left to right by an “artificial highway” . This artificial highway 
consist of an array of L sites or cells. Each site or cell is either occupied or empty. 
In this model we assume periodic boundary conditions. Time is discrete. The 
evolution is synchronous, that is, all cars move at same time. At each time step, 
each driver moves his car to the right only if the next cell is empty. Therefore, 
car at site i can either move to site i + 1 this site is empty, or not move if site 
i + 1 is occupied. Thus, the state of a given cell i depends only on cells i — 1, i 
and i + 1. This model is equivalent to cellular automaton Rule 184 if the state 
of an occupied site is 1, whereas the state of an empty site is 0. The Rule table 
is 

neighborhood 000 001 010 Oil 100 101 110 111 
image 00011101 

In the Fig. 1 we showed the generic de Bruijn diagram for the Rule 184, which 
is a (2,1) automaton (one having two states per cell with a single neighbor on 
each side, according to Stephen Wolfram’s notation) has four nodes correspond- 
ing to the four two-cell partial neighborhood, with eight links representing the 
full three-cell neighborhoods. The still life diagram may be extracted from the 
generic diagram by removing all links which do not conserve their central cell 
during the automaton’s evolution. Loops in the subdiagram composed of the 
remain links define sequences, rather than individual cells, which are invariant 
to evolution. By reading of such sequences, all the still lifes for the automaton 
are automatically determined. 

The topological matrix corresponding to the diagram shown in Fig. 1 is 

0 0 .. 

..01 
11.. 

..01 

Also we can construct subset diagram from de Bruijn diagram. Let A = 00, 
B = 01, C = 10, D = 11. The subset diagram then has 2 4 = 16 nodes for the 
sixteen subsets, but to save space we show only the part connected to the empty 
subset. The destination of each arrow leaving individual nodes is 

Node 0 leads to 1 leads to 
A A3 0 

B C D 

C 0 A,B 

DC D 

Then, form vectors whose elements are the tail nodes and whose components 
are indexed by the head nodes; an element is null if there is no incoming link 
from any node. 
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State 0 
State 1 |B| 




Fig. 1 . De bruijn diagrams generated by the Rule 184. (A) The generic de Bruijn 
diagram. (B) Subdiagrams generated by Rule 184 still lifes. 



Node 


0 leads to 


1 leads to 


{ABCD} (A, A, B+D, 0) 


(C, C, 0) 


{ABC} 


(A, A, B, 0) 


(C, C, 0, B) 


{ABD} 


(A, A, B, 0) 


(0, 0, 0, B+D) 


M 


( 0 , 0 , D, 0) 


( 0 , 0 , 0 , D) 


{C} 


(0, 0, 0, 0) 


(C, C, 0, 0) 


{AB} 


(A, A, B, 0) 


( 0 , 0 , 0 , B) 


m 


(0, 0, 0, 0) 


(0, 0, 0, 0) 



This structure is actually the vector subset diagram; the scalar subset di- 
agram arises by enumerating the non-null indices. We don’t care how many 
incoming arrows there are nor where they originate, as long has there is at least 
one 



Node 0 leads to 1 leads to 



{ABCD} 


{ABC} 


{ABD} 


{ABC} 


{ABC} 


{ABD} 


{ABD} 


{ABC} 


M 


{D} 


{C} 


M 


{C} 


{0} 


{AB} 


{AB} 


{ABC} 


M 


{0} 


{0000} 


{0000} 



its topological matrix, having indexed the subsets in the order shown, is S = a+b 
according to 
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l ABCD I 




Fig. 2. Subset diagrams for Rule 184. (A) Entire subset Diagram. (B) Subset diagram 
that shows only the part connected to the empty set. 
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Fig. 3. Space-time diagram that shows the evolution of the Rule 184. 

Through subset diagram we can know if cellular automaton it has injective 
properties finding gardens of eden. In the case of Rule 184 we can observe in the 
Fig. 2 that 01100 chain or any permutation of this one cannot be generated by 
any predecessor. 

In the Fig. 3 we can observe through the evolution of the Rule 184 that the 
traffic jams move a site to the left in each time step. 



3 LCATRAFFICFLOWVMAX2 

This model is based on the Nagel-Sclrreckenberg traffic model [7]. However, LCA- 
TRAFFICFLOWVMAX2 is completely deterministic. The road is divided on 
one-dimensional array of L with periodic boundary conditions. Each site is ei- 
ther ocuppied by one car or empty. Each car has an integer velocity with values 
between zero and v max . The number of empty sites in front of a car is denoted 




312 R.R. Zamora and S.V. Chapa Vergara 



by gap. For an arbitrary configuration, one update of the system consist of the 
following three consecutive steps, which are applied in parallel for all cars: 

1. If ( v < gap — 1) then v := v + 1. 

2. If ( v > gap + 1) then v := gap. 

3. Car motion: Each car is advanced v sites. 

We represented this model with a cellular automaton of order (4,2), where 
r = 2, K = {0, 1,2,3}, and u max = 2. The four states have the following 
representation: state 0 —1 empty site, state 1 —> ocuppied site with v = 0, 
state 2 ocuppied site with v = 1, state 3 — > ocuppied site with v = 2. The 
Rule table is 

neighborhood 00000 00001 00002 00003 00010 00011 00012 00013 .... 33333 
image 00000000 .... 1 

The de Bruijn diagram of LCATR AFFICFLOWVMAX2 contains 4 2 ( 2 ) = 256 
nodes and 4 2 ^ 2 f +1 edges which represent the 1024 neighborhoods. In the Fig. 4 we 
represented the de Bruijn diagram like a tree, where the partial neighborhoods 
have a decimal representation. These neighborhood are ordered of ascending 
form according to the number of states and to the possible combinations. With 
this type of representation it is easier to identify the nodes that form cycles 
of periodic chains during the evolution of cellular automaton since it is only 
necessary to take like beginning from the cycle some node from the tree and 
to follow his ramifications until finding it again. For example, in the Fig. 4 we 
can see the cycle formed by the nodes 0 — > 2 — > 8 — >35 — > 142 — > 56 — > 
226 — > 136 — > 32 — > 128 —> 0. Although we did not show the subset diagram of 
LCATRAFFICFLOWVMAX2 because its size is too great to this paper -2 256 
nodes- we can say that the chains of l’s form gardens of eden, or in other words, 
traffic jams. 

In the Fig. 5 we showed cycles that form chains in which the cells have a shift 
of 5 sites to the right in 3 generations during the LCATRAFFICFLOWVMAX2 
evolution’s. 

Finally, in the Fig. 6 we showed the LCATRAFFICFLOWVMAX2 evolution. 



4 Concluding Remarks 

The behavior of traffic flow represents a difficult problem as far as the analysis 
that is necessary to make to characterize it. Many works exists in this sense ori- 
ented towards different points with the purpose of including the greater number 
of aspects involved in this complex system. The de Bruijn and subsets diagrams 
are useful tools to analyze traffic flow models because they contain all the in- 
formation referring to the behavior and the characteristics of any linear cellular 
automata. Thus, this kind of tools allow us to know with certainty as the different 
behaviors that can have the traffic flow models are generated. 
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Fig. 4. De Bruijn diagram generated by LCATRAFFICFLOWVMAX2. 
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Fig. 5. Cycles that form chains in which the cells have a shift of 5 sites to the right in 
3 generations. 




Fig. 6. LCATRAFFICFLOWVMAX2 evolution’s. 
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Abstract. The proliferation of mobile computing devices in recent years has 
produced an increased interest in the development of efficient methods of 
communication among these devices. As networked mobile devices move, the 
configurations of the networks of which they are a part change and if the time 
between these changes is short enough then conventional routing algorithms are 
no longer useful. This paper describes the use of cellular automata to model 
message broadcasting among autonomous agents in highly mobile ad hoc net- 
works. The advantages offered by using this method to study the problem are 
discussed and an upper bound on the time steps required by a “reasonable” 
broadcast algorithm is derived. 



1 Introduction 

1.1 The Problem of Message Passing in Highly Mobile Ad Hoc Networks 

When networked mobile devices move, the configurations of the networks of which 
they are a part change. As a result, the best path along which to pass a message from 
one node in the network to another may change over time. If the changes in topology 
occur at reasonably well spaced intervals conventional algorithms for determining 
shortest paths in graphs can be used. But in a more rapidly changing system these 
methods work less well, if at all. 

In this paper we consider networks comprising many nodes which operate inde- 
pendently of each other with no centralized control or global knowledge of the state 
of the network and in which the topology changes too frequently for conventional 
routing methods to work. We consider a number of autonomous agents deployed over 
an area large enough that direct communication between all agents may be impossible 
at any given time. No individual agent is aware of the location of any other of the 
agents except those within its communication range. 

One agent may be in range of a second agent which, in turn, is in range of - or may 
move to come in range of - a third agent not in range of the first. The first can send a 
message to the third by sending it to the second which relays it on. As a starting point 
for studying techniques for message passing in these networks, we look at broadcast- 
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ing a single message from one node to all other nodes by passing the message from 
any node holding it to any other nodes which come within its communication range. 
This paper explores the use of cellular automata to model the agents, the messages, 
and the space in which they move. 



1.2 Why Use Cellular Automata to Study This Problem? 

Cellular automata lend themselves well to problems which are characterized by ele- 
ments which assume one of several discrete states and in which the state of an ele- 
ment at any given time depends in some way upon the states of its neighbors at the 
previous point in time. The use of time steps works well in defining rapidly changing 
systems because it provides a means of expressing the system state at discrete times. 

States can be defined to describe cells with various numbers of agents which, in 
turn, hold or do not hold the message. By selecting a reasonably descriptive subset of 
all possible agent/message combinations, the number of states considered can be kept 
relatively low while at the same time providing a useful description of the system. 

Additionally, while a graphic interface is certainly not a required part of a cellular 
automaton, the nature of the cells - that is, that at any given time, each cell is in one 
of a finite number of well-defined states - makes graphic display of the system 
straightforward. Displaying the system state graphically at each time step provides an 
overview of the activity of the agents and messages which make their movement pat- 
terns much more readily observable than they might otherwise be 



2 Literature Survey 

2.1 Communication in Mobile Ad Hoc Networks 

The primary issue in transmitting messages in any network is finding a reasonably ef- 
ficient path from one node to another. This may mean simply finding the shortest path 
between the nodes but it often involves consideration of other factors which may ef- 
fect the time taken to pass the message. When the nodes are mobile, the problem is 
greater because the connectivity of the networks may change significantly between 
the time when a path is calculated and when it is used [1], One way of addressing this 
is to maintain a routing table at each node listing the nodes it can reach [2], This table 
is updated periodically and shared with its neighbors. 

In [3] the efficiency of maintaining such tables proactively is compared to that of 
building them as needed and, in so doing, the overhead of maintaining tables with 
links to nodes which are no longer in communication is saved. They also use the past 
history of routes in the ad hoc network to make predictions of the stability of paths 
and attempt to use more stable paths. A method of taking “snapshots” of a distributed 
system showing its state at a given point in time is described in [4] and in [5] this 
technique is adapted for use in delivering messages in an ad hoc network. 

Some proposed techniques assign specific tasks to certain nodes [6] . Others assume 
that the network is always connected and that changes in topology are localized and 
use this trait to cluster nodes using weakly connected dominating sets of nodes [7], 
These approaches, however, present a model different from that being considered in 
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this paper. In particular, the first requires that the agents are homogenous and the sec- 
ond makes assumptions about connectivity that our model does not. 

The focus of this paper is to study the ways in which a message moves as it is be- 
ing broadcast with the goal of discovering ways of fostering this activity while not 
compromising the autonomy of individual agents. One problem that arises is that the 
message may be transmitted repeatedly between a given pair of nodes. An interesting 
method for reducing this transmission redundancy is presented in [8]. It is suggested 
that delaying the relaying of the message until an agent has had a chance to move out 
of the area in which it received it can reduce unneeded transmissions. 



2.2 Autonomous Agents 

This paper discusses the representation of autonomous agents moving in some defined 
space. A message is broadcast from one of the agents to all of the others. Cellular 
automata are used to represent the space in which these agents exist and the agents 
themselves - along with the message if they have it - are represented as states. The 
idea of the network’s nodes being autonomous is an important one because it suggests 
that each node is free to operate according to some agendum under its control. We are 
interested in observing the patterns of dissemination that may emerge from letting the 
nodes (agents) operate in this manner and, ultimately using that to advantage. A dis- 
cussion of adaptability and self-organization of agents is given in [9] where it is said 
that autonomy is a necessary and sufficient condition for these to exist. Agent mod- 
eling is discussed in more detail in [10] and several possible attributes of agents are 
listed. 



2.3 Cellular Automata 

A cellular automaton is an entity which can be in one of a finite number of well- 
defined states. The state of the automaton at a given point in time is determined by a 
set of rules based on its state and the states of automata adjacent to it in the automata 
space at the immediately preceding point in time. The updating of automata states in a 
space can be done synchronously, in which case the states are changed in some order 
and the “new” state of a given automaton may depend on the states of some neighbors 
in the previous time step and the states of others in the current step. Alternatively, up- 
dating can be done asynchronously, or in parallel, so that changes of state in neigh- 
boring cells do not affect a cell until the following time step. A history of cellular 
automata and descriptions of several uses for them are given in [11] with further dis- 
cussion in [12]. Applying cellular automata to the problem of location management in 
mobile networks is discussed in [13]. 



3 Simulating Mobile Networks with Cellular Automata 

An environment with a mobile network comprises several elements which must be 
represented by any simulation designed to study message passing in such networks. 
These elements are the space in which the agents (nodes) exist and move, the mes- 
sages being passed among agents, and the agents themselves. 
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Also important is the time required to transmit a message from one agent to an- 
other, as is the speed at which the agents can move. The simulator used for this work 
represents time as discrete steps. The number of steps required for an agent to move 
from one cell to another and the number of steps needed to transmit the message can 
be set independently of each other. 



3.1 Description of Cells and the Space 

The simulation is based on cells that can be in any of several states. The cells used in 
this simulation are square and the space they occupy is rectangular. Three general cell 
states are defined as empty, containing one or more agents without the message, and 
containing one or more agents with the message. 

Whenever an agent with the message comes within range of an agent that does not 
have the message, it passes the message on. Accordingly, if more than one agent oc- 
cupies a given cell, either they all have the message or none of them does. The total 
number of possible states for a cell, is 2a + 1 , where a is the number of agents in the 
system. A given cell can be empty; it can contain up to a agents, none of which hold 
the message; or it can contain up to a agents, all of which hold the message. 



3.2 Definition and Movement of Agents 

The number of agents is specified at the beginning of the simulation and is fixed for 
the entire run. Movement of an agent from one cell to another is modeled by the cell 
of origin changing state to reflect that the agent is no longer in it and the destination 
cell changing state to reflect its presence. Since the number of agents in the system is 
fixed for any given simulation run, if a cell’s state is changed to indicate that an agent 
has moved into it, some other cell’s state must change to reflect the movement out. 

An agent can move only between adjacent cells. That is, a cell’s state can change 
to reflect the presence of an agent in a given time step only if at least one of the cell’s 
neighbors contained an agent in the previous time step. Consequently, the definition 
of “neighborhood” is important. The neighborhood of a cell is simply the set of cells 
defined to be adjacent to it. Two commonly used definitions of adjacency are the von 
Neumann and Moore neighborhoods. These are illustrated in Figure 1 . A Rotated von 
Neumann neighborhood is also considered in this study. Each of these neighborhoods 
can be defined either to include or to exclude the cell itself. 

One of the defining characteristics of agents is their autonomy. This means that we 
cannot dictate the detailed movement of agents in the system. Accordingly, there is an 
element of randomness involved in determining an agent’s next location (The ran- 
domness is implemented using the C++ rand function.) When a simulation is initial- 
ized, the dimensions (rows and columns) of the space are defined and the number of 
agents in the system is set with most of the agents being distributed randomly 
throughout the cells. An exception to this random placement is made in order to as- 
sure that the agent originating the message is in a corner of the space and that at least 
one other agent is in the diagonally opposite corner. 
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(a) (b) (c) 

Fig. 1. Neighborhoods: (a) Von Neumann, (b) Moore, (c) Rotated von Neumann 

3.3 Passing of Messages 

A corner cell is designated to contain an agent which is the message originator and the 
message is passed from one agent to another whenever an agent holding the message 
comes within range of an agent not holding it. This is modeled by changing the state 
of the cell of the agent receiving the message from a cell with one or more agents 
without the message to a cell with one or more agents with the message. 



3.4 Time Steps and Neighborhoods 

For this paper, agents could move to a neighboring cell and the message could be 
passed between neighbors in each time step. The types of neighborhood used for each 
of these need not be the same and can be used to create a more realistic model; by de- 
fining message passing in a Moore neighborhood and agent movement in a von Neu- 
mann neighborhood, the agents are restricted to moving around corners while the 
message can be passed more efficiently. 



4 Studies of Message Passing in Highly Mobile Ad Hoc Networks 

There are many factors to be considered when studying message passing in highly 
mobile ad hoc networks. Among these are the patterns of movement of the agents in 
the network, the rules governing message passing between agents, and the resulting 
patterns of movement of the messages being passed. Once these factors are well un- 
derstood, ways of improving the efficiency of the process may be examined. This pa- 
per reports results from a series of cellular automata-based experiments designed to 
establish a practical upper bound on the time required to broadcast a message and 
thereby serve as a starting point for further investigation. 



4.1 Network Simulations 

The experiments reported on here are based on a scenario in which many agents can 
move independently throughout an area. A single agent is initially given a message to 
be broadcast to the other agents. Cells are updated asynchronously and the simulation 
is stopped when all agents have received the message. 

A set of simulations was run in a 25 x 25 space with different agent densities and 
the average number of time steps taken to complete broadcast in each case was re- 
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corded. The agent initially holding the message is in one corner and there is an agent 
in the diagonally opposite comer. Otherwise, the agents are randomly distributed 
throughout the space. Opposing corner cells represent the worst case in terms of dis- 
tance from other cells and the worst case for time required to complete the broadcast. 

As described in Section 3, the number of time steps needed for an agent to move 
from one cell to another and the number of steps taken to transmit a message between 
agents can be set independently. For the simulations described in this paper, each of 
these numbers is set to 1. Increasing the time needed to move relative to message 
passing produces results similar to those reported here except that, predictably, more 
total time is required. 

Simulations were run using two slightly different rules for agent movement. In one, 
the agents were required to move to a neighboring cell at each time step. In the other, 
they were allowed to remain at their current position. In each case, an agent selects a 
destination cell by choosing from among its neighbors (perhaps including the cell it 
currently occupies) with equal probability. Whenever an agent with the message 
comes within range of an agent without the message, the first agent passes the mes- 
sage to the second. Simulations were run for several combinations of neighborhood 
types. 

The results of runs in which agents are allowed to stay in place - that is, when a 
cell is considered to be part of its own neighborhood - are shown in Figure 2 and are 
analyzed in the next section. When agents were required to move, the time needed to 
complete broadcast was somewhat less (for example, 20 to 30 time steps at a density 
of 10 percent). At higher densities, the times are nearly the same. Note that requiring 
agents to move is a special case of allowing them to remain in place and the resulting 
times would be expected to fall between the lower bound time (discussed in Section 
4.2. 1) and the time needed when they are not required to move. 



4.2 Analysis of Simulation Data 

In this section the data collected above are discussed in light of existing theory con- 
cerning the time required to broadcast in grids. This is then used to establish lower 
and upper bounds on the broadcast time in order to define the limitations and the po- 
tential for gain in broadcast techniques. The simulations above allowed agents to 
move randomly throughout the space and to pass the message whenever possible. By 
placing restrictions on the movement or message passing, gains in time may be 
achievable within the bounds defined here. The potential for these gains is described 
as future work in Section 5 

4.2.1 Lower Bounds for Broadcast Time in a Grid 

To analyze the data from the simulations in Section 4.1, it is helpful to consider some 
underlying theory concerning message passing in fixed networks. In a fixed rectan- 
gular network (grid) the worst broadcast time occurs when then message originates in 
a corner. For a square grid with n nodes on a side this time is 2 n - 2 steps, as proven 
by Farley’s Theorem [14]. 

When the nodes are allowed to move freely, agents in diagonally opposite corners 
can move toward each other, cutting this time in approximately half. If we mix neigh- 
borhood types, the delivery time can be further improved. If the agents move within 




322 



M. Kirkpatrick and F. Van Scoy 



220 
200 
180 
160 
140 

(/) 

B 120 
c / ) 

| 100 
i- 

80 
60 
40 
20 
0 

10 20 30 40 50 60 70 80 90 100 

Density 

Fig. 2. Time steps to complete broadcast as a function of density for different 
move/pass neighborhood combinations. In the key, V = von Neumann, M = Moore, and R = 
rotated von Neumann. The first and second letters refer to the movement and message passing 
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von Neumann neighborhoods, but pass messages within Moore neighborhoods, the 
best-case time can be reduced to n - 2. This scenario is reasonable since the networks 
being modeled comprise physical agents communicating by radio or some similar 
medium. A physical agent must move around buildings, walls, or rocks but the radio 
waves they use to communicate can go through these obstacles. 

Finally, if agents move and messages are passed in Moore neighborhoods, the time 
can be reduced to ceil(n/2) - 1. Using these numbers we can establish a lower bound 
for the number of time steps required to complete broadcasting in a space with given 
dimensions and neighborhood definitions. When comparing broadcasting techniques, 
this can be used as a way of measuring their relative efficiency. 

4.2.2 The Effect of Neighborhood Choice on Broadcast Time 

It is immediately apparent from Figure 2 that, for all choices of movement/passing 
neighborhoods, the higher the density, the sooner broadcast is complete. It can also be 
seen that the number of time steps drops dramatically as density is increased to 
around 40%. Beyond that, the improvement is slight. 

The graph also shows the difference in performance resulting from various neigh- 
borhood choices. Combinations which include a Moore neighborhood complete 
broadcasting more quickly than corresponding combinations which do not because a 
Moore neighborhood allows agents and/or messages to move along the diagonal of 
the space. It also allows movement in more different directions than either von Neu- 
mann or rotated von Neumann neighborhoods, however, decreasing the probability 
that the message will be disseminated in the most efficient direction. 

It is also apparent that the overall most efficient neighborhood choice combines a 
rotated von Neumann neighborhood for agent movement with a Moore neighborhood 
for message passing. So the rate of information dissemination depends not only on the 
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relationship of one cell to another but also on the rate at which the message moves 
away from its point of origin. Movement in a rotated von Neumann neighborhood 
provides the best chance of moving the furthest from the origin most quickly. 

4.2.3 Neighborhoods That Can Cause Broadcast to Fail 

As noted in Section 4.2.1, when the von Neumann/Rotated von Neumann and Ro- 
tated/Rotated neighborhood combinations are used, broadcast is likely to fail to com- 
plete. This is because the two neighborhoods are mutually exclusive. They form, in 
effect, a checkerboard and the rules for moving among them prevent moving from 
black to red squares. Agents move, then, on two separated non-overlapping grids, su- 
perimposed one over the other. 



4.3 Summary of Simulation Results and Comparison with Other Work 

The simulation data analyzed here enable us to determine an upper bound for “rea- 
sonably efficient” broadcast algorithms for use in highly mobile ad hoc networks. 
Farley previously proved the lower bound for such algorithms in fixed grid networks. 
His results are adapted here to ad hoc mobile networks and serve to define that bound. 

This paper presents data showing average results for a number of runs under dif- 
ferent circumstances. Since these results were obtained without defining any particu- 
lar movement pattern beyond random selection of a neighboring cell, it is not reason- 
able to expect that they are optimal. When the density exceeds approximately forty 
percent, however, the number of time steps needed using this technique differs by 
nearly fixed amounts from the theoretical best achievable times. This gives us a loose 
upper bound on what can be considered acceptable. 

These results can be used to gauge the efficiency in terms of time steps needed by 
different algorithms to complete broadcast. The lower bound described is fixed and 
the closer one can come to it, the better. If that bound is not achieved, the times pre- 
sented here define an upper bound on what could reasonably be considered to be 
“close enough.” 

These results are consistent with work presented in [15] in which information dis- 
semination in mobile ad hoc networks is compared to the spreading of epidemic in- 
fections and an analytical model is presented for calculating the time required to reach 
all nodes under various rates of "infection." They demonstrate that, for a given “diffu- 
sion force,” as more time passes, the percentage of infected nodes (in other words, 
agents with the message) increases dramatically to a point and then levels off. This 
agrees with the results presented Figure 2. In this case the diffusion force is related to 
the agent density 



5 Conclusions and Suggestions for Future Work 

Farley’s Theorem concerning message broadcast time in fixed grid networks can be 
adapted to determine the best achievable time in mobile ad hoc networks. This paper 
shows how to do that and then presents a means for establishing a “reasonable” upper 
bound. 

This paper describes the use of cellular automata for the purpose of analyzing mes- 
sage broadcast time in highly mobile ad hoc networks. This work is intended to serve 
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as a starting point for further research. The simulation method used appears promising 
as a means of testing a variety of broadcast algorithms. The next step in this process is 
to study the dissemination patterns produced as agents and messages move throughout 
the modeled space. Once those patterns are well understood, algorithms can be devel- 
oped which may improve the broadcast time. 
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Abstract. Understanding the collective behaviour of packets traffic in 
data networks is of vital importance for their future developments. Some 
aspects of this behaviour can be captured and investigated by simpli- 
fied models of data networks. We present our CA like model describing 
processes taking place at the Network Layer of the OSI (Open Systems 
Interconnection) Reference Model. We use this model to study spatio- 
temporal packets traffic dynamics for various routing algorithms and traf- 
fic. loads presented to the network. We investigate how additional links, 
added to a network connection topology, affect the collective behaviour of 
packets traffic. We discuss how some of the network performance aggre- 
gate measures reflect and relate to the emergence of packets collective 
spatio-temporal dynamics. We present selected simulation results and 
analyse them. 



1 Introduction 

In recent years data communication networks have experienced unprecedented 
growth that will continue in the future. The dominant technology is the Packet 
Switching Network (PSN). Examples of PSNs include the Internet, wide area net- 
works (WANs), local area networks (LANs), wireless communication networks, 
ad-hoc networks, or sensor networks. The widespread proliferation of PSNs has 
outpaced our understanding of the dynamics of packets traffics and their de- 
pendence on network connection topologies, routing algorithms, amounts and 
types of data presented to the networks. For the future development of PSNs 
it is important to understand what effects may have local changes in network 
connection topologies, changes in operation of single routers, or changes in rout- 
ing algorithms on the collective behaviour of packets. The further evolution of 
PSNs and improvements in their design depend strongly on the ability to pre- 
dict networks performance using analytical and simulation methods, see e.g. [1, 
2,3,4] and the references therein. The need to have a simulation model closer 
to the real packet switching networks than static flow and queueing models in 
order to study which network features and parameters affect flow, congestion 
and collective spatio-temporal dynamics of packets in PSNs has motivated our 
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research. Our simulation model describes the processes taking place at the Net- 
work Layer of the OSI Reference Model. For this layer we have constructed a 
cellular automaton (CA) like model [5,6]. In order to simulate our CA model of 
PSNs we developed a C++ simulation tool, called Netzwerk-1 [7]. The benefits 
of our model are that, like real networks: it is concerned primarily with packets 
and their routing; it is scalable, distributed in space, and time discrete. More- 
over, it avoids the overhead of protocol details present in simulators designed 
with different goals in mind. 

We have applied our model and its further refinements to study, among oth- 
ers, the effects of routing algorithms and network topologies connectivity, size 
and the addition of extra links with various preferentiality factors of attach- 
ment on the phase transition from free flow to congestion, throughput, average 
packet delay, packet average path length and speed of delivery, see [8,9,10,11]. 
The presented work is a continuation of our previous investigations. Its goal is 
to study how the packets collective spatio-temporal dynamics is affected by the 
network topology. Also, our aim is to isolate aspects of routing algorithms that 
lead to spatio-temporal self-organisation in packets traffics. We explore, for se- 
lected network performance aggregate measures, how they reflect and relate to 
the emergence of packets collective spatio-temporal dynamics. 

The paper is organised as follows. In Section 2 we describe our CA network 
model. In Section 3 we present selected simulation results and analyse them. In 
Section 4 we provide our conclusions and outline future work. 



2 Packet Switching Network Model 

There is vast engineering literature devoted to PSNs, see [3] and the references 
therein. We limit our discussion to the development of our PSN model. Its de- 
tailed construction is described in [6,7]. The purpose of a PSN is to transmit 
messages from sender to receiver. In a real-world PSN, messages are broken into 
several equally sized pieces called packets. These packets are then sent across a 
network consisting of connected nodes, known as hosts and/or routers. In our 
model we restrict all messages to the length of one packet. Packets contain an 
information payload along with routing and other header information. Since our 
concern is strictly with the delivery of packets across the network, the informa- 
tion payload contained within each packet is irrelevant to our discussion. Hence, 
in the considered models we assume that each packet carries only the following 
information: time of creation, destination address, and number of hops taken. 

As in a real-world PSN, our network model consists of a number of intercon- 
nected switching nodes. Each node can perform the functions associated with 
a host and a router. The primary functions of the hosts are to generate and 
receive packets. Routers are responsible for storing and forwarding packets on 
their route from source to destination. Packets are created randomly and inde- 
pendently at each node. The probability A with which packets are created is 
called source load. Nodes maintain incoming and outgoing queues to store pack- 
ets. For this paper we consider one incoming queue and one outgoing queue per 
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switching node. We assume that the outgoing queues are of unlimited length 
and operate in a first-in, first-out manner. Each node at each time step routes 
the packet from the head of its outgoing queue to the next node on its route 
independently from the other nodes. A discrete time, synchronous and spatially 
distributed network algorithm implements the creation and routing of packets [6, 
7,8]. The considered network connection topologies, routing cost metrics and the 
PSN model algorithm are described in subsections that follow. 

Network Connection Topology. In a real-world PSN, while the physical 
means of communication links may vary, the underlying connection topology 
can always be viewed as a weighted directed multigraph C. Each switching node 
corresponds to a vertex and each communication link is analogous to a pair of 
parallel edges oriented in opposite directions. We associate a packet transmission 
cost to each directed edge, thus parallel edges do not necessarily share the same 
cost. 

For the presented set of simulations we consider network connection topolo- 
gies C = Ci,a (L) that are isomorphic to two-dimensional periodic or non-periodic 
square lattices with the number of nodes in the horizontal and vertical directions 
L = 16. We study both unperturbed networks with l = 0 additional links and 
perturbed ones by adding randomly l > 0 extra links. The first node to which an 
additional link attaches is selected with uniform probability from the set of all 
nodes in the network. The second node is selected randomly from the remaining 
nodes with bias towards the nodes with higher number of outgoing edges. The 
bias is called preferentiality factor ( PF ) of attachment. If the bias is equal to 
zero then the second node is selected with uniform probability. All randomly 
generated links are added prior to a simulation run. All links in the network are 
static for the duration of the simulation. 

Static Routing and Adaptive Routing. In the network models, packets 
are transmitted from their source nodes to their respective destination nodes. 
The path across the network taken by each packet is determined by the rout- 
ing decisions made independently at each node based on a least-cost criterion. 
Depending on the costs assigned to each edge of the graph, we consider routing 
decisions based on the minimum route distance or the minimum route length 
least-cost criterion [2,3]. We consider three types of edge cost functions called 
One (ONE), QueueSize (QS), and QueueSizePlusOne (QSPO) [6,7] that are de- 
scribed below. In each considered network model setup it is assumed that all 
edge costs are computed using the same type of edge cost function. 

Edge cost function ONE assigns a value of one to all edges in the lattice C. 
Applying a least cost routing criterion results in minimising the number of hops 
taken by each packet between its source and its destination node, minimum hop 
routing. Since the values assigned to each edge do not change during the course 
of a simulation, this type of routing is called static routing. The QS edge cost 
function assigns to each edge in the lattice C a value proportional to the length 
of the outgoing queue at the node from which the edge originates. A packet 
traversing a network using this edge cost function will travel from its current 
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node to the next node along an edge belonging to a path with the least total 
number of packets in transit between its current location and its destination. 
The edge cost function QSPO assigns a summed value of a constant one plus 
a number proportional to the length of the outgoing queue at the node from 
which the edge originates. This cost function combines the attributes of the first 
two functions. Routing decisions made using QS or QSPO edge cost functions 
rely on the current state of the network simulation. Hence, they imply adaptive 
or dynamic routing in which packets have the ability to avoid congested nodes 
during a network simulation. 

Each switching node maintains a routing table of least path cost estimates 
to reach every other node in the network. This type of routing scheme is called 
full-table routing. When the edge costs are assigned by the function ONE the 
routing tables can be calculated once at the beginning of each simulation and 
they do not require updating, because edge costs do not change with time. In 
this case the cost estimates are the precise least-costs. When the edge costs in 
a PSN model are assigned by the function QS or QSPO, the routing tables are 
updated at each time step using a distributed version of Bellman-Ford least-cost 
algorithm [2]. Since only local information is exchanged at each time step, the 
path costs stored in the routing table are only estimates of the actual least path 
costs across the network. 

Packet Switching Network Model Algorithm. In our model we consider 
time as given in discrete time units of size one and perform simulations from 
time k = 0 to a final simulation time k = T. We can observe the state of the 
network model at the discrete time points k = 0, 1, 2, . . . , T only. At time k = 0, 
the network is initialised with empty queues and the routing tables are computed 
using the centralised Bellman-Forcl least-cost algorithm [2]. 

One step of the discrete time, synchronous and distributed in space network 
algorithm advances the simulation time from k to k + 1. It is given by the 
sequence of five operations: (1) Update routing tables: The routing tables of the 
network are updated in a distributed manner; (2) Create and route packets: At 
each node, independently of the other nodes, a packet is created randomly with 
source load A. Its destination address is randomly selected among all other nodes 
in the network with uniform probability. The newly created packet is placed in 
the incoming queue of its source node. Further, each node, independently of the 
other nodes, takes the packet from the head of its outgoing queue (if there is 
any), determines the next node on a least-cost route to its destination (if there is 
more than one possibility then selects one at random with uniform probability), 
and forwards this packet to this node. Packets, which arrive at a node from other 
nodes during this step of the algorithm, are destroyed immediately if this node is 
their destination, otherwise they are placed in the incoming queue; (3) Process 
incoming queue: At each node, independently of the other nodes, the packets 
in the incoming queue are randomised and inserted at the end of the outgoing 
queue; (4) Evaluate network state: Various statistical data about the state of the 
network at time k are gathered and stored in time series; (5) Update simulation 
time: The time k is incremented to k + 1. 
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The above described time step from k to k + 1 of the network algorithm can 
be repeated an arbitrary number of times. Step (3) of the algorithm simulates 
a sort of processing delay and ensures that a packet, which arrives at a node 
in one time step, does not leave the node before the next time step. Further, 
randomising the incoming queue simulates that the packets arrive at each node 
in random order. 



3 From Individual to Collective Behaviour of Packets in 
Transit 

In this section we present selected simulation results and discuss the emergence 
of collective behaviour in packets traffic for the described adaptive routing al- 
gorithms. We explore how changes in the network connection topology affect 
critical source load values and packets spatio-temporal collective behaviour. For 
a discussion of how other network performance aggregate measures reflect and 
relate to the emergence of packets collective behaviour we refer to [8,9,10,11,12]. 

Critical Source Load. We classify host source loads A as sub-critical , critical, 
and super- critical source loads. A source load is said to be sub-critical if the 
average number of packets entering the network does not exceed the average 
number of packets leaving the network. This results in free flowing traffic in a 
network where packets reach their destinations in a timely manner. A super- 
critical source load is just the opposite of this: the average number of packets 
entering the network exceeds the number of packets exiting the network. The 
result of applying a super-critical source load is a congested network. Finally, 
the critical source load A c is the largest sub-critical source load, or the phase 
transition point from free flowing to congested network states. This phase tran- 
sition has been observed in empirical studies of packet switching networks [13]. 
For details concerning the calculation of A c for a given PSN model and analysis 
how various aspects of a PSN affect the critical source load value see [6,7,8,9,10, 
11 ]- 

We consider how the random addition of links affects the critical source load. 
The results for periodic (left figure) and non-periodic (right figure) square lattices 
of size L = 16 with PF = 0 are displayed in Fig. 1. The lower graphs in both 
figures correspond to edge cost function ONE, and the upper ones, that almost 
coincide, to the edge cost functions QS and QSPO. We notice that for a periodic 
lattice, in the absence of additional links, all three edge cost functions result in 
similar values for A c . For the non-periodic lattice, A c for the edge cost ONE is 
significantly lower than those for QS and QSPO in this case. The reason is that 
the static minimum hop routing does not route packets around congested regions 
of the network. The critical source loads for the three edge costs of the otherwise 
same model setups are significantly lower for the non-periodic lattice without 
or with relatively small number of extra links than those for the equivalent 
periodic lattices. We attribute this to the increased average distance between 
any two nodes in the non-periodic lattice. This implies, that on average many 
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Fig. 1. A c versus l for £;,o(16) with PF = 0. The left figure shows results for periodic 
lattices, the right for non-periodic lattices. The lower graphs correspond to edge cost 
ONE, the upper ones to QS and to QSPO. The graphs for QS and QSPO almost 
coincide 

more packets remain longer in transit resulting in faster build up of congestion for 
lower source load values. Further, many more packets must transit through the 
centre region of the network, in particular for static routing, i.e. edge cost ONE; 
in the case of QSPO, see Figs. 2 and 3. For periodic and non-periodic lattices 
and adaptive routing, i.e. edge cost QS or QSPO, additional links increase the 
value of A c . In fact, for both, QS and QSPO, there is very little difference in the 
values of A c (this may be due to the granularity of the applied source loads) . For 
edge cost ONE (static routing) the addition of a small number of links results in 
a dramatic drop in the critical source loads. This is likely the result of the local 
congestion occurring at the ends of these few additional links as is seen in [12]. 
As more links are added, the critical source loads begin to recover towards their 
original levels. However, for the periodic lattice the number of additional links 
needed to fully regain this value is much higher than the one presented in the 
graph. With regards to the preferentiality factor of attachment of extra links, 
our study showed that an increase in the PF-value has an adverse effect on the 
critical source load values [10,11]. 

Spatio-temporal Packets Traffic Dynamics. Here, we discuss the effects 
of the lattices periodicity and addition of extra links on the collective spatio- 
temporal behaviour of packets in transit in PSN models with edge cost function 
QSPO for connection topology C = £;,d( 16). The presented results are for pref- 
erentiality factor PF = 0 and numbers of additional links l = 0, 1, and 51. For 
the periodic lattices, the source load values A are 0.125, 0.130, and 0.175, respec- 
tively. For the non-periodic lattices they are 0.090, 0.095, and 0.160, respectively. 
The values of the considered source loads are about 0.005 higher than those of 
the corresponding A c values. For a discussion of results for other cost functions, 
topologies and source loads, see [12]. 

In Figs. 2, 3 and 4 the x- and y-axis coordinates denote the positions of 
switching nodes in C = £z,o(16). The z-axis denotes the number of packets in 
the outgoing queue of the node at that (. x , y) position. In Fig. 2 we present results 
for periodic lattices and in Fig. 3 for non-periodic ones. In both figures, the left 
column corresponds to the lattice without an extra link, the right column to the 
lattice with one extra link. In Fig. 4 we show results for lattices with l = 51 
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Fig. 2. Evolution of queue lengths for edge cost QSPO, A > A c , periodic lattice C = 
£(,□(16) without an extra link (left column) and with an extra link (right column), for 
k = 1600 (top row) and k = 3200 (bottom row) 

extra links; in the left picture for the periodic lattice, in the right picture for 
the non-periodic one. The distribution of packets is presented for two separate 
snapshots in time corresponding to k = 1600 and 3200 in Figs. 2 and 3, and in 
Fig. 4 only at k = 3200. 

The graphs of Fig. 2 demonstrate spatio-temporal self-organisation in packets 
traffic for periodic lattices with l = 0 and l = 1 and emergence of wave-like 
pattern with peaks and valleys in packet distribution for congested networks 
(A > A c ) using adaptive routing with the QSPO edge cost function. As the 
number of packets in the network model increases, the wave-like pattern emerges 
in spite the routing scheme attempts to distribute the packets across the network 
queues evenly. One notices that for each node that is part of a peak (i.e. with a 
relatively large number of packets in its queue), its directly attached neighbours 
are part of a trough (i.e. with a relatively small number of packets in their 
queue). The converse also holds true. The addition of an extra link hastens the 
formation of the wave-like pattern. The pattern is similar to the one obtained for 
QS for periodic lattice [12]. However, the addition of many extra links destroys 
the formation of this pattern, see Fig. 4. Large queues of more or less the same 
size appear at the ends of the extra links with very small number of packets 
elsewhere. 
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Fig. 3. Evolution of queue lengths for edge cost QSPO, A > A c , non-periodic lattice 
C = £;,□ (16) without an extra link (left column) and with an extra link (right column), 
for k = 1600 (top row) and k = 3200 (bottom row) 

The graphs of Fig. 3 demonstrate that connection topology periodicity plays 
an important role in building up of congestion and emergence of packets col- 
lective behaviour. For the lattice with l = 0 we observe across the network an 
essentially random distribution of packet queue lengths with their variability 
increasing over time. The landscape of queue sizes resembles the one observed 
in the case of the edge cost ONE [12], with the difference that here the queues 
are significantly larger at the centrally located nodes. That means that, for the 
QSPO edge cost function, the ONE-part of the cost plays a much more signifi- 
cant role in routing of the packets in the case of non-periodic lattices than in the 
case of the periodic ones. We observe that the addition of one extra link dramat- 
ically changes packets collective dynamics. It destroys the increasing build up of 
the queue sizes of large variability in the interior of the network. The extra link 
provides a “short-cut” across the network and plays the surrogate role of the 
missing links due to the lattices lack of periodicity. The extra link attracts much 
larger numbers of packets than links elsewhere in the network resulting in local 
congestion growing at its ends with time. The remaining packets are distributed 
more evenly among the other queues of the network, then as it was observed 
for l = 0. Initially, the queues are located mostly in the centre of the network. 
As time progresses this local congestion spreads out almost evenly across the 
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Fig. 4. Packet queue lengths for k = 3200, edge cost QSPO, A > A c , lattice C = 
£(,□(16) with 51 extra links: left picture for the periodic lattice, right picture for the 
non-periodic lattice 

network and a wave like pattern emerges among these queues. The sizes of these 
queues stay always significantly smaller than that at the ends of the extra link. 
For times k > 1600, the distribution of queue sizes resembles that of the super- 
position of the distribution of packets for the edge cost ONE with that for QS 
presented in [12]. The addition of a large number of extra links destroys wave 
like pattern and packets queue mostly at the ends of the extra links. Fig. 4 shows 
that the variability of the queue sizes is higher for the non-periodic lattices than 
for the periodic ones. 



4 Conclusions 

We described our CA like model of the OSI Network Layer for PSNs. The 
presented simulation results demonstrate that the network connection topol- 
ogy plays an important role for the amount of traffic that can be presented to 
the network and for the emergence of packets in transit collective behaviour. For 
static minimum hop routing, the addition of one extra link decreases drastically 
the critical source load value and in order for the network to recover its perfor- 
mance many links must be added. For adaptive routing, the critical source load 
values increase with the increase in the number of extra links. We observe that 
the addition of one extra link can hasten the spatio-temporal self-organisation 
in the distribution of packets in transit and lead to the emergence of a wave-like 
pattern of queue lengths across the network. However, addition of many links 
destroys this process. 

The presented study is a continuation of the authors work in [12]. We 
plan further investigation into the factors responsible for spatio-temporal self- 
organisation of queue lengths. Discovering and understanding correlations be- 
tween the traffic patterns and network parameters has the potential to impact 
decisions made with regards to topological and routing scheme designs for real- 
world networks. 
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Abstract. In this paper, we evaluate the feasibility of distributed control of 
shared resources in user networks. A user network is totally controlled by the 
users, both at application and transport level. This paradigm has become possible 
with the advent of broadband wireless networking technologies such as IEEE 
802.11. One of the most popular applications in these networks is peer-to-peer 
file exchange. As a consequence, the “external” access to the Internet (set of 
links between the user network and the Internet) is a shared resource that can 
be optimized by node cooperation (i.e., if a node cannot serve its demand with 
its own external link, it requests help from another node via the high-bandwidth 
internal user network). We consider cellular automata to model such networks and 
game theory to model cell behavior. Every cell chooses to behave as a cooperative 
or defector node. Cooperators may assist in file exchange, whereas defectors try 
to get advantage of network resources without providing help in return. Simula- 
tion results help to understand the conditions for cooperative cells to win the game. 

Keywords: Stochastic cellular automata, user networks, complex systems, game 
theory. 



1 Introduction 

User networks are totally controlled by the users, both at application and transport level. 
This paradigm has become possible with the advent of broadband wireless networking 
technologies such as IEEE 802.1 1. For applications such as peer-to-peer hie exchange, 
it may be useful to consider the “external” access to the Internet (set of links between 
the user network and the Internet) as a shared resource that can be optimized by node 
cooperation (i.e., if a node cannot serve its demand with its own external link, it requests 
help from another node via the high-bandwidth internal user network). Nodes decide if 
cooperation requests are granted or not, either by analyzing limited information on their 
neighbors’ states or only based on their own experience. 

In this paper, we analyze the conditions that enable cooperation in real user networks. 
This is not trivial in a realistic scenario. We could impose several conditions leading to 
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node cooperation that would never hold considering user behavior in peer-to-peer (P2P) 
services. Therefore, we have developed a framework that makes it possible to model the 
real problem and to provide results that explain the conditions for cooperative nodes to 
win the game, i.e., to become a majority. 

In order to hide the complexities of file exchange, we model it as a game where 
cells (i.e. node routers) choose between two different basic strategies: cooperation and 
defection. Cooperators assist neighbor cells in Internet file exchange, while defectors do 
not offer help to neighbors (although they request help for their demands). Thus, defec- 
tors model non-cooperating users, a typical problem in P2P networks. If we consider 
the spatial structure of the population, restricting interaction to local neighborhood, a 
stable spatial coexistence of cooperators and defectors becomes possible under certain 
conditions. 

The context of cooperative games and cooperation evolution has been extensively 
studied in biological, social and ecological contexts [11], seeking general theoretical 
frameworks like the Prisoners’ Dilemma (PD). In his seminal work, Axelrod has shown 
that cooperation can emerge in a society of individuals with selfish motivations [10]. For 
a review of related work in the last twenty years see [12]. 

The paper is organized as follows: section 2 describes user-managed networks. Sec- 
tion 3 briefly introduces some basic concepts of game theory. Section 4 defines the 
Cellular Automaton, which is the basic model for the analysis in section 5. Section 6 
discusses the results of our simulations. Section 7 presents the conclusions of the paper 
and suggests future research. 



2 User-Managed Networks 

User-managed networks have become possible with the advent of wireless technologies 
such as IEEE 802.1 1 [2], They represent one of the last stages in network control evo- 
lution [4]. This evolution started with telephone networks. In them, Telcos controlled 
both transport and applications. Later, the Internet allowed users to control applications. 
Although Telcos still control transport in most of the Internet, in some scenarios carrier 
technology is affordable to end-users, and user-managed “islands” appear in a natural 
way [5,6,7]. For example, this kind of infrastructures is currently being used to provide 
broadband access in Spanish rural areas, as an expansion of shared asymmetric DVB-S 
or DVB-T gateways. 

A typical basic node in a wireless user-managed network is composed by a router, 
an IEEE 802. 1 1 access point (AP) and/or some IEEE 802. 1 1 cards to set links with other 
basic nodes. Basic nodes may also be linked to a multi-user LAN (covering a building, 
for example). 

Figure 1 shows a sector of Vigo (Spain) with 46 basic node locations and the corre- 
sponding wireless links for near-maximum overall network bandwidth (D-Link DWL- 
1000 APs, D-Link DWL-650 wireless cards and AP channel cellular planning as in [13], 
for 50 x 50 m 2 cells). 

A subset of the basic nodes will have cable or DSL access, providing “external” 
connection to the Internet. For the purposes of this paper and without loss of generality, 
we will assume that all basic nodes are “externally connected”. Additionally, we assume 
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Fig. 1 . Network sample: white squares denote isolated nodes 



that user network capacity is much larger than external capacity (this holds for reasonable 
internal and external networking technologies, for example IEEE 802.11b and DSL 
respectively). Nowadays, peer-to-peer file exchange is a “killer” home application [8]. In 
a user network, basic nodes can easily share contents, due to the large internal bandwidth. 
The bottleneck is the set of “external” connections to the Internet. By optimizing their 
usage, overall performance (and, as a consequence, user satisfaction) can be greatly 
improved. 

By network stability we refer to the condition such that external demands (e.g., 
downloads or uploads at the edges of the user network for peer-to-peer file exchanges) 
can be satisfied with external capacity, on average. This certainly holds if: 

1. The external demand of each basic node can be satisfied with its own external 
capacity, on average, 

2. All basic nodes cooperate via the user network and their combined external demand 
can be satisfied with their combined external capacity, on average. 

Even if the first condition holds (and therefore cooperation is not strictly necessary 
to guarantee network stability), cooperation minimizes demand service time (nodes with 
temporarily idle external connections can help neighbors with demand peaks). However, 
there is no central authority, and -probably selfish- nodes will act either from limited 
information on their neighbors’ state or only based on their own experience. As explained 
in section 5, cooperation granting/denial with limited information can be modeled as a 
game. The main goals of this paper are ( i ) to demonstrate that cooperative nodes may 
become a majority in user networks (cooperation may minimize service time) and ( ii ) 
game players can be implemented as distributed basic node agents. In our terminology, 
an agent is the program that implements the cell behavior. 



3 Game Theory 

Game theory [1] provides useful mathematical tools to understand the possible strate- 
gies that utility-maximizing agents may follow when choosing a course of action. The 
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simplest type of game is the single-shot simultaneous-move game. In it, all agents must 
choose one action and all actions are effectively simultaneous. Each agent receives a util- 
ity that is a function of the combined set of actions. In an extended-form game, agents 
participate in turns and receive a payoff at the end of a sequence of actions. A single-shot 
game is a good model for many distributed systems, in which encounters require co- 
ordination. Finally, many scenarios that seem correctly described by an extended-form 
game can actually be modeled as a series of single-shot games. 

The approach we follow in this paper is a composite game where actions are effec- 
tively simultaneous but every agent may interact with several neighbors at a time. Every 
agent receives a data throughput payoff every turn. The better the strategy in its context, 
the better the payoff, i.e., the higher the throughput. After a predefined set of turns d, 
each agent i chooses a strategy s,; £ Si, where Si is the set of all basic strategies for 
agent i. The strategy chosen will be used to interoperate with its neighbors for the next d 
turns. In our case, we restrict the set of strategies to two basic options: defection Sj = 0 
or cooperation Si = 1. Note that the payoff of a given agent depends on the choices 
made by the rest. 

4 Defining the Cellular Automaton (CA) 

In this section we introduce a model for a cellular automaton (CA) taken from [9], to 
describe the different nodes of the network ( cells in the sequel). 

In order to specify cell interaction, we adopt a game theoretical approach. Each 
cell i has two alternative strategies to act in a given situation, defection (s, = 0), and 
cooperation (s, = 1). Then, Si £ S indicates whether agent i is a cooperator or a 
defector. We assume that all agents share the same set of strategies and that the total 
number of agents N is constant. 

For the spatial distribution of the agents we consider a two-dimensional cellular 
automaton (or a two-dimensional square lattice) consisting of N cells. Each cell is 
occupied by one agent. Therefore, each cell is characterized by a discrete value Si £ 
{0, 1}, indicating whether it is occupied by a defector or a cooperator. The spatiotemporal 
distribution of the agents is then described by $ = {si, .S' 2 , s^}. 

Figure 2 shows a cell i and its neighborhood. In the model in section 5, every cell 
(i.e. agent) in the square lattice will interact with I\ closest neighbors to handle external 
traffic. Thus, there are both local and spatial interactions between neighbor cells. 

5 Modeling the Problem 

If we let every agent in the system to interact with the remaining N — 1 agents, we 
have a panmictic population. However, in this paper we are mainly interested in the 
spatial effects of the game. Besides, in real user networks each node interacts with a 
few neighbors. Therefore, assuming we simplify the typical "mesh-like” user network 
topology as a pure mesh, we consider that each agent i only interacts with the K agents 
in its immediate neighborhood. 

Interaction is driven by demand service times of externally inbound or outbound 
data, i.e,, when the external service queue of a cell is larger than a particular threshold, 
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Fig. 2. A two-dimensional cellular automaton. Cell i and two possible neighborhoods. 



the agent controlling the cell contacts its neighbors requesting help to handle the files 
involved. In order to introduce a time scale, the time unit to generate new traffic demands 
and for neighbors interaction is an hour. The total number of interactions per hour can 
be K x N or less. 

We decided that, during a 24-hour timeframe (a day), the strategy s,; of an agent does 
not change. We also decided to consider a day as the time unit for strategy changes, since 
traffic patterns are similar in the same hour across different days. 

After a day passes, ■s l can change as described next for two disjoint scenarios: 

- Oracle-based or centralized scenario. This scenario is unrealistic, because it as- 
sumes that every agent knows the strategy of its K neighbors. It can be considered 
an ideal reference for the distributed scenario below. The agent in cell i mimics 
the strategy of its “best” neighbor k whose agent offered to i the best worst-case 
hourly throughput during last day (out of 24 measurements). We decided to compare 
hourly rather than daily throughput to let user satisfaction be based on a compromise 
between worst-case and average experience. 

The main interest of this scenario is to determine if cooperation is feasible or not. 
For this purpose, we consider the best conditions, i.e., when cells share information. 
Therefore, it allows us to predict the most popular strategy in a game with shared 
information. 

At the end of every day d, agent i calculates a;=argmax a;g ^ (min^ e ^(th(/i, x)), where 
th (h, x ) is the throughput of neighbor x during hour h. Then, agent i mimics the 
strategy of agent x. Obviously, it keeps its previous behavior if the most successful 
agent followed the same strategy. 

- Realistic or distributed scenario. In this scenario, each agent is a stochastic au- 
tomaton with two states: cooperator or defector. Each agent compares its worst-case 
hourly throughput during the last day (out of 24 measurements) with the average 
values obtained in previous history by the alternative strategy s*. If its through- 
put is better than the average value achieved by the alternative strategy, then the 
agent linearly increases the probability of the current strategy s,; (we use the linear 
reinforcement scheme for stochastic automata in [3]). 

At the end of every day d, agent i calculates f(i, <i)=min/ lg d(th(ft,, j)). Then: 
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• If f(i, d) is strictly higher than the daily average values of / up to that moment 
when following the alternative strategy sj, then: 
p ai [d+ 1] = p Si [d\ + 0.1(1 -p Si [d\), and ps-[d+ 1] = 0.9 Psj[d\. 
the strategy for the next day is selected according to probabilities p Si (d + 1) and 
Psi[d+ 1]. 

We assume that, each hour, if the queue length of a cell exceeds a defined threshold, 
the agent ruling that cell requests help from its neighbors for every pending file (incoming 
or outgoing). Neighbor agents may grant their external connections or not, depending 
on their present state and strategy. We implement help transactions using the Contract 
Net Protocol: neighbors answer with a set of offers and/or refusals. The requesting agent 
selects the offering cell that provided the best average throughput in the past. 

We model defection and cooperation as follows: 

- Defection', a defector never helps, so it will never grant its external connection. 
Nevertheless, defectors ask their neighbors for help when they need it. Thus, they 
use shared resources opportunistically as many users do in P2P networks. 

- Cooperation', a cooperator helps its neighbors up to a certain credit limit (denoted 
by CreditLimit in section 6). If cell j reaches its credit limit, cell i does not help 
j again unless j returns its debt by helping i when requested. 

Finally, concerning cell demand distribution, we define two cell types A and B. 
A cells generate uniformly distributed demand service times (externally inbound or 
outbound data) during disjoint busy and quiet periods, corresponding to day and night. 
In B cells, the quiet period takes place during the busy period of A cells, and viceversa. 



6 Simulation Results 

Diverse interesting characteristics of the spatial game appeared in the simulations. Start- 
ing with an initial random distribution of cooperators and defectors, several spatial 
domains appear, each of them dominated either by cooperators or defectors. Also, we 
noticed that, in the long run, the percentages of cooperators and defectors become sta- 
tionary (with minimal changes). 

We performed all the simulations with QNet 1 . As previously said, all case studies 
take place in a completely symmetric user network. Each cell is linked to K neighbors 
(we have considered I\ = 4 and K = 8). The relevant settings for our application are: 

- There are (50 x 50 = 2,500) cells in the user network. 

- Demand distribution settings are: 

• Busy period', the cell generates uniformly distributed demand service times with 
mean 8 units during 12 hours a day. 

• Quiet period: idem, with mean 1 unit during the remaining 12 hours. 

1 QNet (Quarter Net) is a Java-based cellular automata simulator that can be requested to J.C. 

Burguillo, {jrial@det.uvigo.es} 
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Cooperators (%) 




Fig. 3. Percentage of cooperators after 2400 hours, centralized scenario 



- External links have a bandwidth of 5 demand units per hour. Note that this is slightly 
above the average demand (0.5x8 + 0.5xl = 4.5), and therefore external capacity 
is loaded. For example, for 1 Mbps DSL each demand unit is ~100 MB. 

- Cell demand files have a maximum size of two demand units (e.g. 200 MB for 1 
Mbps DSL). 

- At time origin, node strategies are assigned with equal probability. 

- At time origin, cell traffic patterns ( A or B) are assigned with equal probability. 

- Every cell (including defectors) asks its neighbors for help if it generates over 5 
demand units per hour. 

Initially, we ran our simulations with a single cell type (A or B). In this case, defectors 
became a majority. We could have expected this result, since all cells are typically busy 
at the same time of the day and, as a consequence, cooperation is not rewarding. 

When we considered two cell types ( A and B ), we observed the following: 

- Let C and D be the percentages of cooperators and defectors at simulation end, 
respectively. It happens that C » D in permanent state if cooperators choose 
CreditLimit wisely. The larger the value of C, the larger the global user network 
throughput. Roughly, this holds for CreditLimit^ 10 in all cases. Figures 3 and 
4 show the percentage of cooperating cells after 2400 simulated hours, in the cen- 
tralized and distributed scenarios respectively. Cooperation success as a function 
of CreditLimit suggests that, in an agent-based implementation, agents should 
adjust CreditLimit dynamically, according to the queue lengths in their cells. 

- Although cooperators and defectors are completely mixed at simulation start, co- 
operators tend to form confederations. This spatial distribution tends to become 
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Cooperators (%) 




Fig. 4. Percentage of cooperators after 2400 hours, distributed scenario 




Fig. 5. Spatial distribution (K=4, CreditLimit=7.5, 100 days, centralized scenario) 



stable. Figure 5 shows a spatial distribution of the centralized scenario with K = 4 
and CreditLimit=7.5 after 2400 hours (dark cells represent defectors). Figure 6 
shows the evolution of the percentages of cooperators and defectors in two scenarios: 
centralized (left side) and distributed (right side). 

- The choice of CreditLimit is specially critical in the centralized scenario: if co- 
operators are too conservative (low CreditLimit) or too generous (high Credit- 
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Fig. 6. Percentages of Cooperators vs Defectors (K=4, CreditLimit=7.5). a) Left-side: central- 
ized scenario, b) Right-side: distributed scenario. Cooperators win in both cases. 



Limit), cooperation is less successful than defection. However, in the distributed 
scenario, this effect is only evident if cooperators are conservative. A possible expla- 
nation for the result in the centralized case is that, for CreditLimit values around 
10, defectors only can fool cooperators at the beginning of the game, since debts 
have to be promptly returned. If cooperators are too generous, defectors fool them 
completely. On the other hand, if cooperators are too conservative, their behavior 
approaches that of defectors. 

- The peak of cooperation success is lower in the distributed case. This is logical, since 
agents do not share information. However, it is surprising to observe the relatively 
high percentage of cooperators for high CreditLimit values in the distributed 
case, whereas in the centralized case it may be as low as 2% in the same region. 
We do not have an explanation for this interesting effect yet. In any case, we show 
that even the simple stochastic automata in the distributed scenario gets close to 
the cooperation peak in the centralized scenario. Thus, there is plenty of room for 
further improvements. 



7 Conclusions and Future Work 

The game presented in this paper is an abstraction of the complex problem of negotiation 
and resource sharing in user networks. We consider this abstraction reflects the main fea- 
tures of the user network, and we show that under certain conditions (like CreditLimit 
settings) cooperation is the winning strategy, even in case of autonomous node opera- 
tion. We show that the simple stochastic automata in the distributed scenario gets close 
to the cooperation peak in the centralized one. This fact encourages us to study more 
powerful decision algorithms. For example, agents could infer queue occupation peaks 
and strategies of neighbor cells from local records of past interaction. 
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We have observed that a user network can obtain benefits from node cooperation if 
traffic demands are strongly variable during the day, for example in case of several node 
types with complementary activity periods. 

Now that cooperation can be considered a successful strategy in user networks, we 
plan to adapt the simulator to include real network topologies and to explore more 
complex strategies. 



References 

1. Ken Binmore: Game Theory. Me Grow Hill, 1994. 

2. IEEE 802.1 1. [Online]. Available:http://grouper.ieee.org/groups/802/l 1/. 

3. Narendra: Learning Automata. 1989. 

4. J. P. Hubaux, T. Gross, J. Y. L. Boudec, M. Vetterli: Towards self-organized mobile ad-hoc 
networks: the terminodes project. IEEE Commun. Mag., no. 1, pp. 118-124, 2001. 

5. Madrid Wireless. [Online]. 2004. Available at: http://madridwireless.net. 

6. Wireless Athens Group. [Online], 2004. Available at: http://www.nmi.uga.edu/research. 

7. N. Negroponte: Being Wireless. Wired Magazine, 10.10. Oct. 2002. 

8. Kazaa news. [Online]. 2004. Available at: http://www.kazaa.com/us/news/index.htm. 

9. F. Schweitzer, L. Behera, H. Muhlenbein: Evolution of Cooperation in a Spatial Prisioner’s 
Dilemma, Advances in Complex Systems, vol. 5 (2-3), pp. 269-299. 2003. 

10. R. Axelrod: The evolution of Cooperation. Basic Books, New York, 1984. 

11. F. Schweitzer, J. Zimmermann, H. Muhlenbein: Coordination of decisions in a spatial agent 
model. Physica A, 303(1-2), pp. 189-216, 2002. 

12. R. Hoffmann: Twenty years on: The evolution of cooperation revisited. Journal of Artificial 
Societies and Social Simulation, 3(2), 2000. 

13. F. Box: A heuristic technique for assigning frequencies to mobile radio nets, IEEE Trans. Veh. 
TechnoL, vol. VT-27, pp. 57-74, 1978. 




Lattice Boltzmann Modeling of Injection 
Moulding Process 



Jonas Latt 1 , Guy Courbebaisse 2 , Bastien Chopard 1 , and Jean Luc Falcone 1 

1 University of Geneva, Computer Science Department, 1211 Geneva 4, Switzerland, 
Bastien. ChopardOcui .unige . ch, 
http : // cui . unige . ch/ ~ chopard/home . html 
2 LRPP - ERT 10 - ESP, 85 Rue Henri Becquerel 
F-01100 Bellignat, France 
courbebaisseOf ree . f r 



Abstract. Polymer injection in moulds with complicated shapes is 
common in todays industrial problems. A challenge is to optimize the 
mould design which leads to the most homogeneous filling. Current 
commercial softwares are able to simulate the process only partially. 
This paper proposes a preliminary study of the capability of a two-fluid 
Lattice-Boltzmann model to provide a simple and flexible approach 
which can be easily parallelized, will include correct contact angles and 
simulate the effect of the positioning of air-holes. 

Keywords: Polymer injection moulding, Hele-Shaw model, mathemati- 
cal morphology, lattice Boltzmann method, multi-component fluids, nu- 
merical simulation. 



1 Introduction 

In the injection moulding process, as in many other scientific problems, numerical 
simulations play an important role. Polymer injection pressure, injection points 
and locations of air evacuation channels all affect the quality of the final object. 
In particular, the re-attachment of two fronts of matter (weld lines) leads to 
strong internal constraints and weakens its mechanical resistance. 

Commercial softwares, such as MoldFlow, offer powerful tools to simulate the 
moulding process in the free surface approximation. They provide a quantitative 
description in which polymer properties can be adjusted. However, they offer 
little flexibility to combine the simulation with a shape optimization procedure. 
In addition, the free surface approximation does not allow the designer to study 
the effect of the location of air evacuation points. Finally, parallelized versions 
of such softwares are often not available. 

There is a need among the practitioners of this field to develop new ap- 
proaches. In particular, there seems to be a lack of tools offering a pre-modeling 
of the injection process, with enough flexibility to investigate the impact of ge- 
ometrical factors. There are several examples in which the mould geometry is 
dominating in comparison with the fluid rheological properties. For instance, 
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a recent original method based solely on the propagation of distances demon- 
strates that the injection process is more driven by the geometry of the mould 
cavity than by the non-Newtonian features of the polymer. In other words, the 
crucial hydrodynamic properties are mass conservation, forbidden regions and 
behaviour at the mould boundaries. 

Along this line, the use of mathematical morphology concepts [1,2] for the 
simulation of the propagation of matter in a cavity, with one or more points 
of injection [3], is quite promising. The work by G. Aronsson [4] provides an 
explicit link between the concept of propagation of distances and the spreading 
of fluids with power-law stress tensors. 

The goal of this paper is to exploit the flexibility and simplicity of the Lattice 
Boltzmann (LB) method [5] to address the mould injection process at a pre- 
model level. We consider the so-called Slran-Chen model [6] of two immiscible 
fluids representing the polymer and air respectively. We assume that Newtonian 
fluids will provide good approximations if geometrical factors are dominant. In 
this preliminary study we shall not pay too much attention to the actual ratio 
of densities and viscosities of the two fluids. 

We will first test our approach on the so-called Hele-Shaw situation (see for 
instance [7]) and then consider a more realistic mould cavity. The LB results 
are compared with MoldFlow simulations. A discussion of the critical features 
of our approach is finally given. 

Note that, at the time of writing, we became aware of a similar work by 
Ginzburg [8] in which a free-surface LB model is developed and applied to the 
filling of a cavity. However, this model does not include air behaviour and does 
not take into account the effect of air evacuation points. 

2 The Lattice Boltzmann Model 

Lattice Boltzmann (LB) models are historically derived from cellular automata 
fluid models [9,5]. These models are seen as an alternative to finite element 
techniques for fluid flow computations. They have been successfully applied to 
multi-phase and multi-component flows and several other applications. In the 
present context, we propose to adapt the LB methods to the filling phase of an 
injection moulding process. 

2.1 The Lattice Boltzmann Equation for a Single Fluid 

A LB model is built on a discrete space-time universe. The fluid is described 
by density distributions function fk(x,t), k = 0 ... 2 containing the amount of 
fluid that enters lattice node x at time t, with velocity Vk,k = 0 . . . 2 . The 
possible velocities are constrained by the lattice topology. Typically, 2 is the 
lattice coordination number and the discrete set of velocities is chosen so that, 
in one time step At, the particles travel one lattice spacing Ax in any of the 2 
lattice directions. One also considers a rest population of fluid particles fo, with 
velocity uq = 0. 
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The usual physical quantities such as local density and velocity are obtained 
from the fk by the following relations: 

2 Z 

P=^2 -fk and u = ^2 fkVk • (1) 

fc=0 k - 0 



During time evolution, the fluid density functions are streamed on the lattice, 
in the direction specified by k. After the streaming, a collision occurs between all 
the fk entering the same lattice site. The result of this collision is to redistribute 
mass and momentum to each lattice directions, thus creating the post collision 
values of the fk- It can be shown that, provided that the collision operator is 
properly chosen, the LB dynamics solves the Navier-Stokes equation [9,5], in the 
low Mach number regime. 

It is also shown that the fluid pressure is related to the fluid density by the 
ideal gas state equation p = c 2 s p , where c 2 s is the speed of sound. Therefore, in 
a LB model, there is no need to solve the pressure equation. It is all built-in in 
the equation of motion for the fk ■ 

In what follows, we consider a two-dimensional system, with z = 8. This 
corresponds to the so-called D2Q9 model [5]. The collision is obtained by a 
relaxation with coefficient ui to a truncated Maxwell-Boltzmann local equilibrium 
distribution function f^ eq \ which depends only on the current local fluid density 
p and velocity u. This approach is referred to as a LBGK model [9,5]. 

The following equation summarizes the evolution rule: 

fi(x + Vi At , t + At) - fi(x , t) = -u ( fi - f - eq) ) + 7 F mt ■ Vi (2) 

where F lnt is a term which may account for extra interaction force acting on 
the particle density distribution. The coefficient 7 is a normalization parameter 
which depends on the chosen lattice topology. 

In order to model a system of two immiscible fluids (polymer and air) we 
use the so-called Slran-Chen model [6]. Two sets of fk are defined, one for each 
species. The values corresponding to the two fluids will be distinguished by a 
supperscript a £ 0,1. Each fluid follows the dynamics of a single fluid system, 
as given in eq 2. There are only two changes that must be made to describe the 
interaction between the fluids. 

First, the fluids define a common velocity and use it for the computation of 
the equilibrium distribution in equation 2 



'LL = U (°) 



tz« = 



E 



(3) 



Second, the term F lnt is computed so as to produce a repulsive interaction 
force between the two fluids. It depends on a parameter W that accounts for the 
surface tension of the fluids: 



F znl (*, *) = Wp M ' f-(<T) ( X + TV k) V k- 
k 



(4) 
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In the bulk of the fluid, the quantity corresponds to the density of the other 
fluid: 

( 5 ) 

When one models immiscible fluids, one chooses an interaction force that is 
sufficiently strong to keep the penetration between the fluids low. The simulated 
flow field is thus separated in two regions, in each of which one fluid is dominant. 
In a small range around the interface between those two regions, both fluids 
coexist symmetrically. In this range, the density variations are large and the 
approximation for an incompressible fluid is not valid any more. The results 
must rather be understood as representative of some kind of interface dynamics 
between the immiscible fluids. 

3 Boundary Conditions 

On the boundaries of the domain, a specific dynamics is applied on the fluid 
density distributions ff in order to produce the desired effect: no-slip on a wall, 
fixed air pressure on the air-holes and specific contact angle at the interface 
between air, polymer and solid walls. Finally, polymer injection points also need 
a special treatment. The boundary conditions we propose are discussed below. 



3.1 Bounce Back Boundaries 

We implement no-slip boundaries (i.e. boundaries on which the velocity is zero) 
by assigning bounce back dynamics to the boundary nodes. Thus, the dynam- 
ics (2) is replaced by a reflection rule: 

ff\x,t + At) = (x,t). (6) 

The index opposite Of (i) is the index associated with the direction opposite to 
direction i. v 0 p ,p 0 s t t^of ( % j — Vj ■ 

The flow sites next to the boundaries interact with the boundary sites through 
a wetting parameter gO ) . The value of from equation 4 is thus defined on the 
boundary nodes by 

pO) = g O) ( 7 ) 

The value of g^ a> determines the contact angle of the fluid interface on the 
wall [10]. In our simulations, we have chosen g ^ = —g < ' 1 ~ a \ The situation is 
explained on Figure 1. On this figure, fluid 1 is wetting, and fluid 0 is non-wetting. 
This effect is obtained by clrosing > 0 and < 0. 

3.2 Inlet and Outlet Boundaries 

As a boundary condition for the injection spot and the air-holes, we define semi- 
permeable walls: one fluid is retained by a normal no-slip condition, while the 
other fluid is injected or evacuated. 




Lattice Boltzmann Modeling of Injection Moulding Process 349 



Fluid 0 \ Fluid 1 



F 



( 0 ) 

wall 



F 

I 



( 1 ) 

wall 



vzzzzzzzz&zzzm 



Fig. 1. Bounce back boundaries with a defined contact angle. 



Let us consider as an example the case of fluid 0 being evacuated through the 
semipermeable wall. We chose to perform the evacuation at a constant pressure. 
This is achieved by applying on fluid 0 the (constant pressure) outlet boundary 
condition described in the reference [11]. Furthermore, an interaction term = 
p(°) is defined on the wall. This means that fluid 1 acts like a bulk node (5), 
rather than a wall node (6). 

For fluid 1 we implement a (no-slip) bounce-back dynamics on the wall. 
Consistently with the presence of this semi-permeable wall, a wetting parameter 
if-(o) _ g(0) j g defined. 

4 Simulation of an Injection in Free Space 

A first validation of the numerical model is obtained from a simple simulation 
with cylindrical symmetry. A polymer is injected at a spot in free space and 
expands in the form of a disk of increasing radius R. This is the well known 
Hele-Slraw model [7]. In this case, the boundary conditions have a subsidiary 
importance. We can therefore verify whether the basic properties of the polymer 
are properly resolved by the model. 



4.1 Assumptions About Material Properties 

In the particular case of injection moulding, we consider only very high viscous 
fluids. In consequence, the inertial and gravitational terms will not be taken into 
account in the momentum equation. This simplification leads to a well known 
equation : the Stokes equation (see for instance [7]). During the filling phase, the 
polymer is assumed to be incompressible. The resulting equations are : 

S7.v = 0 (8) 

— Vp + V .g_ n = 0 (9) 



where v is the fluid velocity, p the pressure and q_ n the viscous strain tensor. 
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4.2 Hele-Shaw Assumptions 



The Hele-Shaw model describes the filling of a mould cavity whose thickness can 
be neglected. The filling fluid flow is modelled according to the Stokes equation 
between two plates separated by a short distance h. 

We consider a cartesian coordinate system where the 2 -axis is in the thickness 
direction. The x — y plane is on the mid plane of the cavity and the velocity 
components u x ,u y and u z are respectively taken in the x, y and 2 directions. 
The Hele-Shaw flow model consists of the following assumptions [12]: 

— The 2 -component of the velocity is neglected with respect to the other ve- 
locity components. 

— The pressure is constant across the thickness. It only depends on x and y. 

— The velocity gradient in the x and y directions is independent of the 2 
direction. 



Applying the Hele-Shaw assumptions to the Stokes equation, one obtains the 
simplified expressions: 



1 dp d / du x \ dp d / du y \ dp 
dx = dz ylh ) ’ dy t|a| ) ’ dz 



(10) 



with ?y the viscosity. 

These equations can be solved explicitly in the case of a disk mould with a 
constant injection rate Q [7]. 

Inside the injected polymer, the distribution of the pressure is predicted as 
follows: 

p{r) = a + b In r (11) 

where r < R is the distance to the injection point and a and b are constant. The 
distribution of the velocity is given by 

p(r)v(r) = (12) 



Finally, the expression for the surface growth of the polymer follows directly 
from the mass conservation in an incompressible fluid: 



nR 2 = Qt 



(13) 



4.3 Numerical Result 

The geometry of the simulation is a square domain of size 201 x 201, initially 
filled with air (fluid 1) at a density (pressure) of 1.0. At the boundary of the 
domain, the pressure of fluid 1 is kept constant at this value. 

Fluid 1 (the polymer) is injected in the middle of the system, on one lattice 
site, with fixed density (pressure) p\ = 1.1. The two fluids have same viscosity 
(relaxation coeficient w = 1) and their mutual interaction is defined through 
W = -0.3. 
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t = 4000 t = 40000 t = 80000 t = 140000 



Fig. 2. Amount of polymer injected as a function of time, in a free space. In the later 
stage, the anisotropy is due to the air which cannot escape fast enough out of the 
boundaries. 



The evolution of the system is represented at four chosen time steps in Fig- 
ure 2. We expect from equation 13 that the radius of the injected disk increases 
as a square root of the time. This fact is verified on the simulations, both from 
the program MoldFlow and from the lattice Boltzmann model (see Figure 3). 

We have measured the velocity and pressure profile in the simulation at the 
time step t = 140000. The results inside a radius of r < 30 are shown in Figure 4. 
They exclude the domain close to the free boundary that obeys special interface 
dynamics. Note that the velocity v is the modulus of the velocity (■ u x ,u y ). 

In agreement with the predictions of the Hele Shaw theory (equations 11 
and 12), the datapoints are fitted with a logarithmic law for the density and an 
r -1 law for the velocity. The value of Q used to fit the data is the one that is 
measured in the simulation: our injection boundary condition at a given pressure 
(density), fills the mould with a constant flow Q. 

The above results show that Hele-Shaw model is correctly obeyed by the LB 
simulation. 



5 Simulation of a Realistic Moulding Problem 

5.1 Overview 

As a second sample application, we consider the moulding of a charge card with 
integrated chip. In this case, the space taken by the chip is excluded from the 
moulding domain. 

The aim of this application is to show that even somewhat complex geome- 
tries can be addressed with the simulations. In general, the implementation of ap- 
propriate boundary conditions is a very delicate topic. However, fairly appealing 
results are obtained already with the simple boundaries we have implemented. 

5.2 Results 

In the simulation, the surface of the charge card is initially filled with air (at a 
density po = 1.0). The polymer is injected on the side opposite to the chip, at a 
level corresponding to the middle of the chip location. 
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Fig. 3. Evolution of the disk radius during an injection in free space. On the left hand 
side (a), the results from the program MoldFlow are presented, on the right hand side 
(b) the results from a lattice Boltzmann simulation. On the upper part of the pictures, 
the surface of the injected polymer is plotted at successive timesteps. On the lower 
part, the curves for the time evolution of the radius are fitted with the predictions of 
the Hele-Shaw theory. 




Fig. 4. Velocity profile (a) and pressure profile (b) at t = 140000 obtained from the 
lattice Boltzmann simulation. The solid line shows a fit with the predictions of the Hele 
Shaw theory. 
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Fig. 5. Mould of a charge card. The curves show the propagation front of the polymer 
as computed from the program MoldFlow (a) and the lattice Boltzmann method (b,c). 
In (b), a bounce-back condition is applied on the surface of the chip, whereas in (c), 
the air is evacuated at constant density on the chip perimeter. 



The injection is made at a density pi n = 1.01. The air is evacuated on the 
right boundary of the card at a constant density p ou t = 1.0. All other boundaries 
implement a bounce-back condition with g ^ - 1 = 0, which produces a 

contact angle of 7t/2. The surface tension is chosen at a value W = —0.3. 

Figure 5 plots the area occupied by the polymer at successive timesteps, first 
calculated by the commercial program MoldFlow and, secondly, calculated by 
the LB method. The difference betweend (b) and (c) consists in the way the 
boundary conditions are defined on the chip perimeter (see caption) . The results 
of both simulations are very similar in the beginning. In the second part of the 
simulation, the LB method turns out to be more sensitive to the geometry of 
the domain than the MoldFlow simulation. 

The sensitivity of the flow dynamics to the geometry of the domain that we 
observe in the LB simulation is due to the pressure of the air on the polymer. 
Indeed, one of the main advantages of the LB method, the ability to model the 
influence of the air on the dynamics, also introduces some complications. If, as 
in our simulations, the air is not evacuated fast enough, this can have a serious 
impact of the flow dynamics; the resulting flow geometry may disagree with real 
moulding applications, where the air is much lighter and less visous than the 
polymer. In this context, some work still needs to be done either on the flow 
properties of the air or on the boundary conditions. 

6 Conclusion 

The LB methods brings several interesting elements in the process of injection 
modeling. The possibility to consider the full system of two fluids (air plus poly- 
mers) and to include air traps makes it possible to model how air escapes from 
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the mould as the polymer expands. This feature is not offered by current soft- 
wares. In this respect, our preliminary results are relevant. They are especially 
encouraging in the case of the charge card with a complex geometrical shape. 

One assumption of our preliminary study was that both fluids have the same 
viscosity and density. The simulation shows that although this assumption is 
sufficient for explaining some geometric properties of the moulded polymer, it is 
too strong to be able to represent dynamic features such as the detailed polymer 
front propagation profile. Therefore, future work might include a more realistic 
model for the different physical properties of the polymer and the air. Our study 
must be continued in order to give a reliable solution for polymer injection 
moulding and to include optimization processes where the best injection points 
and air-holes are determined to minimize weld lines formation. 

In conclusion, we think that Lattice Boltzmann methods are promising can- 
didates for the simulation of polymer injection moulding and for the solution of 
current problems such as melt polymer/mould wall interaction. 
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Abstract. A mathematical crystallization model in the meso scale (the interme- 
diate dimension scale between interatomic distance in solids and grain size in 
metals and alloys) is presented with the use of a kinetic-diffusion cellular 
automaton model. The proposed model differs from other models by including 
the diffusion of elements and heat conductivity effects on the physical phenom- 
ena at the solid-liquid interface. The model considers the non-equilibrium char- 
acter of real processes of phase transformation, where the kinetic undercooling 
of the solid-liquid interface is a measure of this non-equilibrium level. The ani- 
sotropy of interface mobility is assumed. The growth of individual dendrites 
was simulated for different initial conditions and growth parameters. The mod- 
elling results are compared to the experimental data. 



1 Introduction 

Although analytical solutions of dendritic growth tasks are very important, they are 
only applicable to very simple tasks [1], They permit the determination of basic de- 
pendencies and rules typical of this phenomenon, for instance the tip-splitting effect 
[2] or morphology diagram of structure patterns in the parameter space of undercool- 
ing - surface tension anisotropy [3], Analytical solutions may also be treated as 
benchmarks for the verification of numerical solutions. 

Numerical simulation is currently used for analysing several specific issues. Most 
importantly these include the use of phase-field or cellular automation (CA) methods 
that not only permit the analysis of the steady state but also that of the non-steady 
state of the solidification processes [4]. 

The phase-field modelling method [5] permits the precise determination of many 
parameters related to the creation of a microstructure, e.g. the kinetics of the growth 
of the dendritic front, the preferred growth orientation, etc. Unfortunately, due to the 
model physics involved, it is necessary to apply a computation grid step that is 
smaller than the width of the diffuse interface layer [4] which amounts to several in- 
teratomic distances. Despite the use of modern supercomputers, advanced numerical 
solutions and the multiplication of the diffuse interface width by the arbitrary multi- 
plier of the order of 10-100 [4], the current phase-field model may only be treated as a 
scientific research instrument to study the solidification phenomenon in the area of 
one or several dendrites and it cannot be used to model the casting structure. 
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The order of magnitude of the computation grid step applied in CA technology is 
between the interatomic distance and the linear size of the microstructure elements. 
This significantly reduces the modelling time in comparison with the phase-field 
method and gives an opportunity to apply CA to multiscale-type models in order to 
model the casting structure. 

Already the first attempts to use CA technology in order to model the solidification 
process showed that this model may be used for analysing dendritic growth [6-8], 

In the nineties of past century, the application of a technique of cellular automaton 
was gaining a wide-spread popularity in simulation of the solidification process [9- 
14]. Rapid development of computational techniques with easy access to the computa- 
tion tools favoured even more progress in this field. 

Up-to-date publications about solidification modelling with the use of the cellular 
automation [6-13] do not foresee that the velocity of the phase boundary migration 
may depend on the value of local undercooling. This possibility was taken into con- 
sideration in [15, 16], but only for the front of the primary dendrite arms. 



2 Diffusion-Kinetic Model of Dendritic Growth 

2.1 Classification of Cells and Capturing Rules 

The present model extends further the classification of cells and introduces own set of 
capturing rules which forecast how the cells of liquid will be absorbed by the growing 
grains of solid phase. The classification bases on a condition of the nearest neigh- 
bouring cells (N, W, S and E in Figure 1) and cells along the diagonal (N-W, N-E, S- 
E and S-W). The applied rules of classification of the interface cells are compared in 
Table 1, while some examples of the practical use of these rules are explained in Fig- 
ure 2. Like in other available models, the division of cells into liquid cells (type Liq), 
solid cells (type Sol), and interface cells (type Int) was preserved. 



Table 1. Classification of interface cells 



Type of interface cell 
and its description 


Number of neighbouring cells in solid state 


neighbouring 


along diagonal 


Int-1 


Nucleus 


0 


0 


Int-2 


Planar boundary 


1 


- 


Int- 3 


Corner 


0 


1-3 


Int-4 


Angle 


2 (side by side) 


- 


Int-5 


Narrow passage 


2 (facing each other) 


- 


Int- 6 


Passage bottom 


3 


- 


Int- 7 


Closed cavity 


4 


- 



The following set of rules is proposed to describe changes in the type of cells and 
capturing of the liquid cells by interface during solidification: 

Rule 1. At the instant when in a nucleus cell of type Int-1 a content of the solid phase 
f= 1 appears, the cell changes its type to Sol. The neighbouring cells from its nearest 
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environment (N, S, W and E) still in the liquid state (type Liq) are captured by the 
growing grain, thus creating type Int-2 with all the attributes of a grain of this type (a 
number, orientation angle of crystal lattice, and phase type in the case of multiphase 
growth). 

The neighbouring cells along the diagonal (N-W, N-E, S-W and S-E) still in liquid 
state (type Liq) are also captured by the growing grain, acquiring all its attributes but 
changing their type to Int-3. 

Rule 2. At the instant when a cell acquires type Int-2 (planar boundary) with the solid 
phase content f= 1, its type is changed to Sol. The neighbouring cells of type Liq 
present in the nearest vicinity are absorbed by the growing grain and acquire type 
Int-2 with all the attributes of a grain of this type. 

The neighbouring cells of the type Int-3, belonging to this grain, change their type 
into Int-2. The neighbouring cells of the type Liq along the diagonal are also captured 
by the growing grain, acquire all its attributes, but their type changes to Int-3. The at- 
tributes of none of the 8 closest cells will change, if the cells have already been cap- 
tured by other grains. 

Rule 3. Rule 2 also applies to cells of types Int-4, Int-5 and Int-6. 

Rule 4. Rule 2 is invalid in the case of cells of type Int-3. 

Rule 5. At the instant when in a cell of the type Int-7 the content of solid phase f = 1 
is obtained, the type of this cell is changed to Sol. 

Rule 6. Other changes in the type of cells result from the count of the neighbouring 
cells of the type Sol according to Table 1. 



N-W 


N 


N-E 


W 


i 


E 


S-W 


s 


S-E 



Fig. 1. Schematic diagram of classification of the neighbouring cells by cellular automaton 



The developed classification and the set of rules describing transformation of the 
cell type in cellular automaton enable during simulation of solidification minimalising 
an effect of the computation grid structure with cells of an appropriate size on a pref- 
erential orientation of the dendrite arm growth forced by the grid symmetry. 



2.2 Heat and Mass Transport 

In the presented model, a coupled solution of the non-linear differential equations of 
diffusion was made on a common mesh of finite differences with square cells and a 
common time step. 
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Fig. 2. Examples of practical application of the schematic diagram of interface cells classifica- 
tion : a) planar boundary, b) corner, c) angle, d) narrow passage, e) passage bottom (bold line - 
solid phase/interface cells boundary, double line - liquid phase/interface cells boundary; broken 
line - hypothetical true position of the growing grain boundary; S - solid cell (type Sol), L - 
liquid cell (type Liq), I - interface cell (type Int); numbers -interface cell type according to Ta- 
ble 1) 



Since source terms were taken into account, as a solution of model equations an it- 
eration method was proposed, viz. a simple iteration based on the following equa- 
tions: 




1 + kr +k s T +kj +kj 



( 1 ) 
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c„, = (fee + fee + k~c: + k E c c E n )+ 

+ C,(1-(1 -k)f s ))Z 

where: T - temperature; k - coefficient of solute redistribution between the solid and 
liquid phase; C - true concentration of additional element in liquid (C L ), if the neigh- 
bouring cell is of type Liq, or concentration in liquid results from the coefficient of 
redistribution (C/k), if the neighbouring cell is of type Sol; L - heat of phase trans- 
formation; f s , f s - volume fraction of solid phase and its changes in the last time step, 
c v - specific heat volume, 

Z = l-(l-&)-(/ s + Af s ) + kc +k s c +kc +kc . 



In these equations the subscripts n and n+1 denote the iteration number, and super- 
scripts - addressing of the neighbouring cells (see: Fig. 1). The values of coefficients 
k c and k T are: 



, _ DAx t _ XAx 

k c — — y ’ % — - 



Ax 



c„Ax 



( 3 ) 



where: X - thermal conductivity, D -coefficient of diffusion. At- time step. Ax -cell 
size in cellular automaton. 

Similar equations of iteration were applied to the cells outside interface, assuming 
appropriate values of f s and A f s = 0. 

Due to great differences in the coefficient of diffusion of the solid and liquid phase, 
its effective value in the interface cells was determined as a weighted average. 



2.3 Direction and Curvature of Phase Boundary 

To determine the direction normal to a surface of the phase boundary, a well-known 
method based on the determination of vector components joining the centre of a bulk 
block of the neighbouring cells of the size 3x3 with the centre of a single cell was ap- 
plied. 

The methods of determination of the front curvature based on the cells count [17, 
1 8] cannot always ensure the satisfactory precision. In the case when higher precision 
was required, in the presented model an approximate computation of the curvature 
based on its geometrical definition was applied. As an approximate value of the inter- 
face curvature one can adopt a quotient of the difference in angles between the direc- 
tions of the solidification front in the neighbouring cells and the lengths of the front 
along these segments. In the proposed model, a central difference diagram depicted in 
Figure 3 was used. 

To determine a sign of the front curvature (convex or concave), it is enough to 
compare the sines of the angles of deviation of side BC and of one of the sides AC or 
AB in this triangle from the vertical or horizontal direction. The values of the sines of 
these angles are determined from simple geometrical relations, for example: 
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Fig. 3. Schematic representation of designations used for computation of the front curvature 



sin {ZCBG) = 



y{c)-y{B) 

V Me) - y{B)f + (x(B) + 1 + x(C)) 2 



(4) 



2.4 Migration Rate on Phase Boundary 

The velocity of crystal growth depends on the kinetic undercooling AT fl [19] 

u=\l-AT ]i ( 5 ) 

and is related with the front topography and crystallographic orientation (an anisot- 
ropy of the kinetic coefficient in crystals of metals characterised by face-centred lat- 
tice amounts to about 17% [20]). 

The kinetic undercooling is expressed by equation: 

AT [l =T Eq —T-AT K , (6) 

where T - temperature on the solidification front, T - equilibrium temperature for 
planar phase boundary with contribution of segregation, A T K — Gibbs undercooling 
(the effect of phase boundary curvature), which for the crystals of isotropic surface 
tension is determined from equation [2 1 ] 

y T K 

A T k =^-, (7) 

where: y - the Gibbs energy falling to an interface unit, T m - absolute equilibrium 
temperature for a planar surface, K - front curvature. 
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It can be assumed that the anisotropy of kinetic coefficient is described by relation- 
ship [22, 23] 

l-l(0) =lV( 1_5 n c os(wse)), (8) 

where: |i (l , 8 m - mean value and amplitude of changes in kinetic coefficient, m s - fold 
of crystal symmetry. 

Angle 9 determines here a deviation of the growth surface direction on a specific 
section from the selected base crystallographic orientation. 



2.5 Transformation Rate 

The time t o , required for the solidification front to cover the entire cell can be defined 
as a quotient of the cell diagonal projection on vector of the front migration velocity 
(dj and velocity modulus (Fig. 4) 



t = d lu . 

o u / 



( 9 ) 




Fig. 4. Schematic diagram to derive relationships between parameters of the solidification rate 
vector and the phase transformation rate in a cell of cellular automaton 

Applying properties of the vectors scalar product, an absolute value of the diagonal 
projection length on vector of the velocity can be defined as 



d x ‘ 




+ d y 


u y 


-\ 


ju. 


2 2 
;+»; 



( 10 ) 



A relative change in fraction of the transformation products during time interval At 
can be assumed equal to a ratio of this time interval and t o 
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4 f s 



uAx 

\r 



( 11 ) 



Having substituted to this equation a value of d u from equation (10), we obtain a 
formula determining the rate of transformation 



2 . 2 

«v +U y 



Ax d,.\uJ + d„u, 



( 12 ) 



If the mesh of cellular automaton has square cells, the formula can be transformed 
to a form similar to that proposed in [12] 



4 f s = 



uAt 



Ax(cos0 + sin©} 



(13) 



3 Results of Simulation 

The results of numerical computations consistent with the presented model are shown 
as an example in Figure 5, which also shows the growth of a dendritic grain inclined 
at an angle of about 20° to the axis of coordinates. The computations were made for 
parameters of a model material (a "succinonitrile - acetone" solution) according to 
[3], [24] on a mesh of cellular automaton with 800 x 800 cells (the length of a cell 
side was 0.05 pm). The value of the kinetic coefficient was accepted at a level of 
10' 3 m/(s-K) from 10% anisotropy; the initial undercooling was assumed to be 13.5 K. 

As results from 5, the disturbance of the modal value of 8 develops at the initial 
stage of nucleus growth. The primary dendrite arms are visible growing towards the 
maximum mobility of the solidification front. The “sea-weed”-type structure develops 
in the space between them. These arms grow towards the p(0) minimum value. They are 
marked with arrows on the picture 5 f and have a specific dense branching morphology 
(tip-splitting) shape [2]. The structure of a two-dimensional dendrite observed during 
the solidification of the model substance has similar characteristics (fig. 6). The dendrite 
arms of the dense branching morphology type are shown on fig. 6d by markers. The 
shape and positions of these arms are compliant with the modelling results. 



4 Conclusions 

The virtual structure of dendrites obtained with the help of the developed model is in 
terms of its quality consistent with the experimental structure of two-dimensional 
dendrites, formed during the dendritic solidification of "succinonitrile - acetone" so- 
lution in thin layers. The results of simulation obtained by means of the model pre- 
sented in this study are independent of an orientation of the axis of coordinates. 
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Fig. 5. An example of the two-dimensional dendritic growth - simulation 




Fig. 6. Change in the structure of two-dimensional dendrite during its solidification from model 
material ("succinonitrile - acetone") 
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Abstract. We study the modified bootstrap model as an example of 
a cellular automaton that is dominated by the physics of rare events. 
For this reason, the characteristic laws are quite different from the sim- 
ple power laws often found in well known critical phenomena. Those 
laws are quite delicate, and are driven by co-operative events at diverg- 
ing length and time scales. To deal with such problems we introduce a 
new importance-sampling procedure in simulation, based on rare events 
around “holes” . This enables us to access bootstrap lengths beyond those 
previously studied. By studying the paths or processes that lead to emp- 
tying of the lattice we are able to develop systematic corrections to the 
theory, and compare them to simulations. Thereby, for the first time 
in the literature, it is possible to obtain credible comparisons between 
theory and simulation in the accessible density range. 



Complex dynamical processes abound in nature, and their understanding 
represents one of the great challenges that scientists face as they try to advance 
our knowledge of the natural universe. The field of ‘complexity’ in the physical 
sciences takes a particular view of this endeavor that has been very successful 
in many arenas of study. In this approach, understanding complex problems is, 
unlike many areas of physics and chemistry where detail is everything, based 
on the simplest possible manifestation of the problem. Thus, by stripping away 
the details that are not responsible for the primary observation, one obtains a 
simple model that contains the essential features of the problem to be solved. 

This approach is sound only when we can be relatively certain that there 
are striking regularities, or generalities to be learned about the system, for we 
have learned by our training and experience that such phenomena are rarely 
determined by a multiplicity of detail. The development of simple models is at 
its most successful when the phenomena originate in a long length and time scale, 
for then we have learned, as in problems such as critical phenomena [1], that 
many of the observations may also be reproduced in a numerically quantitative 
manner. 

The ideas to be discussed in this paper grow from many studies over several 
decades of assemblies of highly coupled units, each which has a state (for example 
‘movable’ and ‘immovable’ particles or ‘on’ and ‘off’ processors), that depends on 
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those of its close neighbors. Thus, the initial state of the system may be highly 
constrained, with very few opportunities for overall movement. Significant groups 
of units may require a coherent sequential switching of units, perhaps extending 
over very large length-scales in the system. This type of problem may also imply 
an activation barrier, but here we emphasize that the barrier crossing requires 
movement of many units. Thus long times scales may emerge from not just the 
intrinsic barrier, but the large number of coherent units required to move along 
the ‘reaction co-ordinate’ or direction of ‘easy’ motion. 

The need to work with simplified dynamical models is clear, for often they 
represent great simplifications of the more realistic dynamical systems that have 
proven to be difficult to solve by other means. Frequently massive computational 
effort is directed at such problem, only to find that the key phenomena are still 
beyond the length and time scales of the physical problem. Those who study such 
problems as protein folding [2] and the dynamical arrest [3] or glass transition 
[4] are well aware of the limitations of the direct computational approach. In 
this paper we will focus on dynamically constrained lattice models [5,6,7,8,9,10], 
special classes of cellular automat that are proving very important in studying 
such questions of ‘slowed’ dynamics. [11,12]. There has been something of a 
mystery surrounding this class of cellular automata [13] that has lead some to 
question the value of computational studies at all because of the long history 
of disagreements between theory and simulations. We are now in a position to 
clarify this issue, and that is the main purpose of the present paper. 

The bootstrap percolation [14,15,16,17,18,19,20,21] problem has attracted 
considerable interest from a variety of scientific communities. In essence it (or 
its obvious variants) is a method to analyze the dynamics of such a system of 
highly coupled units, each of which has a state that depends on those of its close 
neighbors. In the past such units have been considered particles, processors [22], 
or elements of a growing population. Units that become active in the underlying 
dynamics are simply removed in the bootstrap. The ensemble can undergo a 
transition to an “arrested” state in which all dynamics is quenched, and this 
is (after bootstrap removal of units) reflected in a transition from an empty to 
partially filled lattice. This arrest is found to be driven by a long length scale. The 
change of state of one unit becomes increasingly difficult near arrest, requiring 
a long sequential string of favorable changes in its surrounding units. Thereby 
a long length (the size of the surrounding region to be changed) is slaved to a 
long (relaxational) time scale. 

In the well-known conventional “bootstrap model” (in the context of parti- 
cles) [15,16,23], particles are removed if they are surrounded by c or less neigh- 
bors. We study in more detail the modified bootstrap [17,20,24]. In this particles 
are removed if any two of its vacant nearest neighbors are also second neighbors 
to each other. The (random bootstrap) transition occurs as a function of c (or 
the underlying local rule), system size L , and initial particle density p and the 
transition point is defined by the condition that half of the prepared initial states 
are empty after removal of all movable particles. 
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The bootstrap-type problems mentioned above fall into two broad “univer- 
sality” classes of arrest transition [16]. The first (not studied here) a continuous 
(“critical” point) transition in which progressively more particles lose motion, 
leading to growing arrested domains whose typical size diverges with a power 
law. 

The second type of transition (of interest to us here) is more reminiscent of a 
first-order transition, and the theory is more subtle. There, dynamically slowed 
domains grow near arrest according to an essential singularity. Mobilization of 
these domains is dependent on rare events (we call these “connected holes” [11, 
25]) involving specific units that can “nucleate” motion on this large length. As 
will become clear, these “nuclei” become rare (highly dilute) near the transition, 
the typical distance between them being the (diverging) bootstrap length, while 
the time taken for communication between them, since it involves a diverging 
number of intermediate units, also diverges. 

For such transitions the following conclusions have been drawn by the com- 
munity. For c = d (the dimension) it is believed that the bootstrap length £ 
diverges according to an essential singularity £ = exp°^ _1 ^(— A/(l — p)) where 
expoM-i) i s the exponential function iterated d — 1 times [16,26,27]. For the 
two dimensional square lattice c = 2 and the modified model, theoretical calcu- 
lations [28] have resulted in an elegant outcome; essentially what are believed 
to be exact results, lim „_n 2(1 — p)log£ = A , where A = 7t 2 /9 and 7t 2 /3 for 

£— >oo 

conventional and modified bootstrap respectively. 

Attempts to obtain this asymptotic result by simulation have so far failed, 
including some of the most extensive calculations to have been applied in sta- 
tistical mechanics (up to L = 128,000). It has been speculated that theory and 
simulation might agree for system sizes that are far beyond what will ever be 
possible, or indeed of interest, for Physics to explore [29,28,13]. We will show 
that this need not be the case. Though the singular behaviour implied by such 
models is subtle, it can be studied by both theory and simulation. 

We will develop a new approach to the theory, and the simulation of the 
cellular automata under question. 

Thus, we identify “holes” on the lattice as spaces (vacancies) into which par- 
ticles can move [11]. We then identify these holes as either “connected” or caged 
(disconnected) according to whether the lattice can (or cannot) be vacated by 
sequentially removing particles beginning from that hole. The relationship to 
conventional (random) bootstrap simulations (described above) is clear; a given 
system size and density must contain at least one connected hole for it to be 
vacated by random bootstrapping processes. Thus, the bootstrap correlation 
length £ is related to the connected hole density v via v = l/£ 2 . The bootstrap 
length is therefore the average distance between connected holes (these repre- 
senting the growth “nuclei” alluded to in our introductory remarks) that become 
increasingly rare near arrest. The device of holes allows us to focus on the key 
“order” parameter, rather than the very populous, but irrelevant particles and 
vacancies [11]. Here we present some results for d = 2. 
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Fig. 1. Modified Bootstrap Model. The (o) points represent the total hole density (the 
dotted line is a guide to the eye). The symmetrically growing squares (□) are compared 
with Holroyd’s [28] asymptotic result exp(— 7r 2 /3(l — p)) (lower solid line). Also shown 
are results for diffusing squares (A) and small-asymmetry rectangles (V)- in these cases, 
the lines through the points represent the theoretical result. 



In the simulations we begin by creating a hole with the appropriate weight, 
and then populate (‘grow’) the configuration with particles and vacancies around 
this site, checking at each stage to see if that hole is connected, or trapped [11] 
by identifying cages at that length. Since the typical cage size grows much more 
slowly (log(l — p)/log(p)) than the bootstrap length, we need check only rela- 
tively small distances to determine if the hole is connected. This is the imporant 
difference between studying the simulation model in the usual complete manner. 
Since the imporant events are so rare, we study only parts of the automata that 
have them, and then combine the results in an appropriate manner. 

The outcome is that the results produced here require only a few hours 
of time on a personal computer. In Figure 1 and 2 we show results for the 
total connected hole density in the modified and conventional bootstrap models 
respectively and these agree, where comparisons are available, with the most 
extensive conventional simulations. For example, the (o) points in the uppermost 
curve of Figure 2 represent the hole density implied by the results in [29,30] 
(system size L = 128,000), while the (o) points on that same curve are from 
our importance-sampling procedure discussed above (we have simulated up to 
an effective size of L = 200, 000). 
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Fig. 2. Bootstrap Model. The (o) points represent the total hole density (the dotted 
line is a guide to the eye) and can be compared with the results for l/£ 2 ( o ) for 
£ > 100 [29,30]. The symmetrically growing squares (□) are compared with Holroyd’s 
[28] asymptotic result v = exp(— 7r 2 /9(l — p)) (lower solid line). Also shown are results 
for diffusing squares (A) and small-asymmetry rectangles (V). 



Theoretical calculations [7,24,28] have approximated the process of simulta- 
neous removal of particles on increasingly large square contours until reaching 
one that is entirely occupied by particles, and therefore immovable. The hole 
that originated this process is then termed disconnected. Thus schematically 
the probability of bootstrap is represented 77^. 1 (1 — p ak ) b , where 6 represents 
the number of sides and a the increment on the lattice. In the high density 
limit one takes the natural logarithm of the product, approximates the sum 
with an integral and makes the substitution y = p k . For p — > 1 this leads to 
— b/(alnp) fgidy/y) ln(l — y) ~ — &7r 2 /6a(l — p) [31]. In the modified bootstrap 
model, a = 2 and 6 = 4, one obtains — 7 t 2 /3(1 — p), and it is this result that 
has been shown to be asymptotically exact [28]. From Figure 1 it is clear that 
there is no agreement between the simulated results for the total hole density 
(o) and the asymptotic result, even for the highest densities we can simulate. 
We would like to begin by identifying how much of this disagreement is due to 
the primitive nature of the approximation, and how much is is due to the fact 
that we are working with an asymptotic result. 

To begin with we calculate exactly the probability for removal of concentric 
squares of particles, a result that includes all corner contributions, and is valid 
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for all densities. Then, 

p^ = {i-p)n^ s \ P ) (i) 

4 cs) (p) = 1 - 4 p 2k+1 + 2p ik+1 (2 + p) - 4 p 6k+1 + p sk 

Using (1 — p 2k+1 ) 4 as lower bounds for the coefficients c£ s \p) we find a modified 
asymptotic result for the hole density (1 — p )~ 5 exp(— A/(l — p)) [31]. 

Now we are also able to simulate the same process of simultaneous removal 
of squares, as a check on the analytical formulae. We find that the modified 
asymptotic result is almost equal to our numerical solution of Equation 1 and 
simulation results for the symmetrically growing squares process in the density 
range of interest. While these are in perfect agreement with each other (see 
(□) in Figure 1), they are still many orders of magnitude different from the 
full simulated connected hole density, although there is clearly a considerable 
improvement over the purely asymptotic result exp(— 7r 2 /3(l — p)). 

These results indicate that part of the lack of agreement between asymptotic 
hole density and the simulations is that the asymptotic formulae are simply not 
appropriate for the density ranges that are accessible. However it also suggests 
that in the density ranges of interest to physics the theory may also be limited 
because we have not included enough paths or independent processes of particle 
removal. Therefore our aim now in development of the theory is to systematically 
enlarge the possible paths in the calculation, until we approach the full simulated 
result, at each stage validating the theoretical calculation by simulating the same 
restricted set of paths. 

The problem is simply appreciated. Thus, if we attempt to remove all of the 
edges of the boundary simultaneously the process will fail if even one of the 
edges does not contain the appropriate vacancy. Now we can somewhat improve 
the situation by simultaneous removal of two adjacent edges causing the void 
to move in the direction of that diagonal. By continuing this process randomly, 
changing to other pairs of adjacent edges when a given direction is blocked, the 
expanding void diffuses. This diffusive process therefore confers some improved 
capacity for the expanding void to avoid being blocked by a continuous line 
of particles. However, if one included a full set of particle removal paths the 
voids would be free not just to diffuse but also elongate. This would imply that 
a blocked edge also elongates, eventually including another vacancy, at which 
point we are free to remove that edge. 

We thus identify two types of movements of the expanding void around a hole 
that should be taken into account; diffusive motion and elongational fluctuations. 
We can, again by simulation, estimate the relative importance of these. In Figure 
1 we show (A) the increase in the connected hole density implied if we allow 
only this diffusive motion. In the full simulated curve (o) we admit all diffusive 
paths, and also all types of elongation, and the difference between the results 
is therefore due to these shape fluctuations alone. However we may also check 
the nature of the elongational fluctuations that are required to obtain the full 
answer. The answer is surprising for we have found that, to obtain the full 
simulated answer we need never permit paths where the voids exceed an axial 
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ratio of more than 1.4. In fact by far the largest single contribution comes from 
near-square processes, providing they are permitted to diffuse. Our overall aim of 
theory is therefore to implement firstly, the diffusive square process, and secondly 
some degree of elongational fluctuation. 




Fig. 3. Intermediate configurations arising in evaluation of the sum over ‘paths’ that 
may be used to empty the lattice. The inner square from each of these figures is assumed 
to have been emptied of particles in the previous step, and the next parts to be emptied 
lie between this and the outer bounding square. Shaded outer regions imply a complete 
line of particles, blocking further movement in that direction. Unshaded outer regions 
have at least one vacancy in that part of the perimeter. The arrows on the squares 
indicate possible transition involving the growth of two adjacent boundary lines by one 
step. This process restores that local configuration to one of the intermediate states 
again, with the internal empty square one step larger in two directions. The process 
terminates when the local configuration makes a transition outside of the class P 1 - 
P 3 (diffusing squares) or P 1 - P 5 for small-asymmetry rectangles. Note that in P 3 we 
have given an explicit example of the extension of the boundary implied by the arrow. 



We have been able to realize these aims. 

We define intermediate states of the system as the squares illustrated in 
Figure 3, with weight . Paths implied by removal of particles map the growing 
vacant region between only these states, larger by one step at each stage. If we 
consider only the limited set P 1 - P 3 , such processes correspond to growing 
and diffusing squares. Inclusion of the states P 4 and P 5 permits in addition 
“fluctuations” of the square by one additional layer. These local intermediate 
states are related by the coupled equations, 

f(p) = E c i ,Jl w p l-iW ( 2 ) 

j 

where i, j’s range from 1 — > n, with n = 3 in the diffusing squares process, and 
n = 5 in its extended small-asymmetric rectangular version, (p) defines the 
probability of migration from class j to i at the fcth step. These equations are 
solved subject to the initial conditions, Pf ,) = (1 — p)Sij. 

The choice of these states, and transitions between them, is far from trivial 
since we must ensure an “ordering” of the removal process if we wish to use a 




372 P. De Gregorio et al. 



random measure for the particles in calculating the coefficients c^' 2 \p). For the 
sake of simplicity, we here present only the coefficients of the process involving 
diffusing squares, 

4 M) = 1 - 2p 2k + p 2 *- 1 

4 1.2) = (i-p)(i-/) 

4 1.3) = (i-p) 2 . 4 2,1) = 2 p k ( 1 - p k ~ l ) 

ci 2 ’ 2) = l-2/ + p 2fe - 2 

4 2.3) = 2p(l ~ p — p k ~ l + p k ) 

4 3,1) = p 2fc_ \ 4 3 ’ 2) = /d-/- 1 ) 

4 3 ’ 3 ) = p 2 (l + 2 p k ~ 2 - 4 p k ~ x + p 2k ~ 3 ) 



The total bootstrap probability (p) may be calculated numerically for 
any density, limited only by the precision of the computer. The same processes 
may be simulated on the computer and are in each case identical with the theory. 
Results for all are given in Figure 1 and for modified bootstrap. 

Diffusing squares (A), improves the comparison between theory and full sim- 
ulation, and small-asymmetry rectangles (V) yields results that may (for the first 
time) begin to be credibly compared to computer simulation of the full simulated 
hole density. 

We may now summarize our results both in relation to the bootstrap problem, 
and in a more broad context. 

The first point is that cellular automata that describe rare events, such as the 
bootstrap percolation model, and its ‘connected holes’ may be best simulated 
by the type of importance sampling we have proposed. That is, rather than 
simply applying periodic boundary conditions in a square or cubic array we 
might instead ’grow’ the configurations of the automaton around the rare events. 
In our case this was simple in that in the bootstrap we seek only to identify 
whether a hole is connected or not, and this is possible by studying successively 
larger environments around the hole. This is, to our knowledge, a new way of 
approaching the simulations of such models, and is a theme that might be worth 
exploring in more detail for this model, and for other examples. 

Our second conclusion is that these rare events (the connected holes) confer 
a rather delicate singular behavior on the correlation length that is not easily 
dealt with in theory, except in the truly asymptotic limit. It transpires that the 
system enters this asymptotic limit at such large length scales that it is difficult 
for simulations to reach them. However, it also transpires that the theory is 
extendable so that it is a reasonable approximation across a wider range of 
density, hole density, or bootstrap length. Thereby comparisons between theory 
and simulations become feasible, indeed systematically improvable as desired. 

This is an important point, for these improved theoretical approaches may 
be shown to share the same leading asymptotic form that has been proposed in 
the past. However, that leading asymptotic behavior is strongly modified at all 
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but the closest distances to the transition. There is therefore no inconsistency 
between the theory and simulations, in the density regime where both can be 
studied. Equally the original theoretical predictions of asymptotic behavior are 
correct. It seems clear, however, that the length scales at which that leading 
asymptotic behavior becomes relevant is simply too large to be of interest to 
simulators, or indeed, to physicists. 
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Abstract. Based on the Relaxation Element Method, the propagation of zones 
of localized plastic deformation in a polycrystal under loading has been simu- 
lated within the framework of continuum mechanics. The model can be referred 
to the class of geometrical models, known as cellular automata. Plastic defor- 
mation is considered to occur under the action of differently scaled stress con- 
centrators. A plastically deformed grain acts as a stress concentrator at the 
mesoscale level. The involvement of a cell into plastic deformation was deter- 
mined by the value of the critical shear stress in the cell center along the slip 
system. This approach allows to analyze the number and interaction of slip 
systems and accounts for work-hardening on the patterns of the propagation of 
the bands of localized plastic deformation in aluminum polycrystals. 



1 Introduction 

Strain localization is one of the most important possible material instability phenom- 
ena in solids. In order to describe this phenomenon, many authors [1, 2] introduce 
gradients either in space or in time into the constitutive equations, that result in the 
necessity of using a high resolution numerical technique where the accuracy of the 
solution depends on the number of calculational cells. That is why analytical solutions 
to resolve such kind of problems are more attractive. It is necessary, therefore, to 
develop new-basically analytical- calculation methods for determining the zones of 
localized plastic deformation with gradients. The relaxation element method, initially 
proposed by Ye. Ye. Deryugin [5] is most suitable to handle the above problem. The 
necessity of the development of a new modeling method was caused by experimental 
facts on mechanisms of plastic deformation, having been obtained in the last 10-15 
years. 

According to experimental investigations by Neuhauser [3,4], obtained with a 
videorecorder on the sample surface, the plastic deformation under active loading 
proceeds at all detectable time scales (from microseconds up to minutes) in form of 
instabilities, which only from the macroscopic point of view can be considered in 
terms of a smooth curve of loading. Dislocation theory gives an analogous conclu- 
sion: as soon as a dislocation configuration looses its stability due to an increase in 
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applied stress, the further development of the shear zone is a rapid dynamic process. 
Once operated, a shear zone remains during further deformation, as a rule, «frozen». 
Traditional approaches of continuum mechanics, being justified on the macrolevel, 
should therefore be completed by other methods when attempting to describe the 
processes on the level of a separate shear zone. The basis of all approaches of classi- 
cal mechanics is the assumption about the existence of a local law of plastic flow in 
one or in another form, i.e. an unambiguous connection between stress and strain (and 
in generalized theories [5] also the spatial and temporal derivatives of strain). Data on 
the development of plastic deformation, in form of instabilities in local regions, lead 
to the conclusion that on the level of a separate shear zone the law of plastic flow in 
the above sense can not be formulated. Using the relaxation element method [5-10], 
plastic deformation can be analyzed on such a mesoscopic level. 



2 Physical Principles and Algorithm of the Model 

The analysis of many experimental data shows, that in real materials the process of 
plastic deformation occurs not simultaneously over the whole volume of material, but 
includes separate structural elements. Then a natural reason of grain involvement into 
this process is the presence of stress concentrators of a definite scale [11], The stress 
field in the vicinity of a triple grain junction under tensile loading has been calculated 
by the relaxation element method [12], when one of the grains undergoes plastic 
deformation. It was shown, that at high gradients of plastic deformation in front of the 
boundary of this grain, the corresponding stress concentration can exceed the exernal 
applied stress by one order of magnitude. However, the influence of such a 
mesoconcentrator is limited over a very short distance, i.e. only several dozens of in- 
teratomic distances. Nucleation of only several dislocations can cause essential rela- 
xation of these microstresses. This means that microconcentrators like dislocations 
cannot provide a sufficient value and rate of plastic deformation in the neighbouring 
grain to transfer further rapid propagation of plastic deformation from grain to grain. 
Hence, an additional concentrator is required, having the region of elevated stress 
concentration, embracing the region, being commensurable with the dimension of 
crystallite (mesoconcentrator). A plastically deformed grain besides the peak stresses 
at the junction creates an elevated stress concentration, embracing the region, 
commen-surable with grain dimension. The plastically deformed grain acts as a stress 
concen-trator on the mesoscale level. When simulating on the mesolevel, the 
plastically deformed grains are represented as a circular site of plastic deformation in 
a plane isotropic polycrystal, we assume that plastic deformation in the site occurs 
uniform. To explain the essence of the relaxation element method, consider the 
following situ-ation: A stress-strained state of a plane with the above-mentioned 
plastic deformation site with specified boundary conditions on the circular contour. 
Fig. 1 shows the loading of such a plane schematically. The tensile stress o is directed 
along the y-axis. We restrict our attention in what follows to small strains (0,0002%) 
and the material is characterized in terms of linear elastic constituent relation. 
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Fig. 1. Schematic representation of the boundary conditions for the plane under loading with a 
circular site of localized plastic deformation 



Taking into account the experimentally discovered fact that plastic shear is 
accompanied by a stress drop in the local volume of solid [11], let us assume that 
under plastic deformation, the uniform stress field is decreased by Aa. Then the stress 
field inside the site will be described by g-Ag. According to the superposition law of 
linear elasticity, the general solution to a boundary problem can be represented by a 
number of simpler solutions, provided that the resulting boundary conditions shown 
in Fig. 1 remain the same. This can be simplified as follows: the uniform stress field 
G-A g (Fig. lb) is singled out from the general solution. What remains is the solution 
for a plane under an external load Aa (Fig. lc), if the stresses are zero only in the 
local region occupied by the plastic deformation site. De Wit showed [13], that 
absence or reduction of stresses in the given region is the result of plastic 
deformation. In our case this comparative plastic deformation ensures a displacement 
of the circular contour under the external stress Ao, when there is no stress inside the 
contour. The boundary conditions, according to Fig. 1, are those of a flat testpiece 
with a circular opening subjected to the uniaxial Ag. 

According to the Kirsh’s solution [14], the stress field outside a circular contour in 
the Cartesian coordinates is characterized by the following components: [14,15]: 
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where R - is the grain radius, r 1 = x 2 +y 2 - is the distance from the center of inclusion 
to the point with the coordinates (x, y); F = Sy\3a 2 +2y 2 )/r, G = 24 R 2 y 4 /r. 

The displacements of the arbitrary point (x 0 , y a ) on the contour are defined by the relation 



m^ 0 >To)=3t 0 Ao-/E, u x (x 0 ,y 0 )=-x 0 Ao/E , (3) 

where E - is Young’s modulus. The boundary conditions (3) are satisfied by a 
homogeneous field of plastic deformation with the components 



e x = -Acr/E, e v =3Acr/E, e xy = 0 



(4) 
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Fig. 2. Spatial distribution of the Act component of the internal stress field of the circular 
relaxation element. (External applied stress cris directed along the y-axis). 



From the performed analysis it follows, that the site of plastic deformation can be 
considered as a defect of a definite scale with its own field of internal stresses, set by 
equations (2). Such a defect is called a relaxation element (RE). Shown in Fig. 2 is an 
example of the component Act distribution (2) of the given circular relaxation ele- 
ment. It is seen, that there exists a stress peak on the boundary of the RE. The region 
of elevated stress is comparable with the dimension of the relaxation element. At a 
distance not less than 1,3 a from the center of the RE the perturbation of stresses in 
this region affects greatly the development of plastic deformation in the adjacent 
grain. Once the grain with the maximum value of shear stress is known, the external 
stress field needs to be determined under which the resolved shear stress is attained. 
The calculation mesh was schematically presented by a total of 1250 (50x25) cells 
shaped as hexagons, playing the role of grains in polycrystal. In each of the grains the 
slip system orientation with respect to the axis tension was set using random number 
generator. The second slip system was added at an angle of 60° with respect to the 
previous one. The involvement of a cell into plastic deformation was determined by 
the value of T cr , a critical shear stress in the cell center along the slip system. Plastic 
deformation initiates on the slip plane of those crystallites, in which the resolved 
shear stress reaches a critical value along the favorably oriented slip direction. The 
resolved shear stress was calculated with the following formula [14]: 

r=(a]-<7)sinacosa+(7 v (cos 2 a-sin"a). (5) 

The action of the element transition from the elastic into the plastic deformed state 
was defined by placing of an RE of round shape inscribed into the hexagon in its 
center. It changes the stress-strain state of the polycrystal aggregate as a whole and 
the region of operation of maximum shear stress. The act of grain involvement into 
plastic deformation is accompanied by a stress drop within the local region (in our 
case within the circular region) and the stress perturbation beyond this region. The 
problem now consists in designing a cycle that can, according to certain criterion, 
determine the sequence of grain involvement into plastic deformation after the first 
RE has appeared. 

2.1 Calculational Procedure 

In what follows we represent the process of plastic deformation as a sequential chain 
of elementary acts of stress relaxation in the different crystallites. Suppose the sped- 
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men is subjected to tensile loading along the y-axis and the external applied stress is 
equal to o ext = a v . The external applied stress creates a shear stress in the crystallites: 

T v =<r„sina t cos a, (6) 

where the indices i and j unambiguously define the x,y coordinates of the given crys- 
tallite. Having determined the coordinates of the crystallite in the slip system in which 
the shear stress is maximum, the external applied stress in the formulae is obtained as: 

<7 ext = r/sina v cosay (7) 

According to the condition that the critical shear stress equals T :r = 50, we obtain a m 
=50/sina ij cosa, With known external applied stress and given coordinates of the 
crystallite, in which the maximum shear stress is attained, a relaxation element is 
placed in this crystallite. The relaxation element creates a field of stresses , the 
compo-nents of which are represented by equations 2. Instead of Ao we take 1 , Vi, 14, 
of the minimum external applied stress defined at the previous step. Such values of 
A a were chosen with taking into account the fact that the stress drop inside the region 
of relaxation should not exceed the external applied stress. Because the total shear 
stress in the centers of each crystallite is expressed by the formulae: 

T n =(J exi s ' na ij cosa tj + (<7 yij ~(J xjj ) sin a y cosa tj +d xyij (cos a tj - sin 2 a ,), (8) 

the external applied stress can be calculated in each crystallite is defined by the 
formula: 

cr exli j= ( 50-(d yij -a xij ) sina v cosay +a\ sij (cos 2 a ij - sin a it ))/sin a :j cosay (9) 

So we have an array of external applied stress in each crystallite under which the 
critical shear stress T cr = 50 MPa is attained in each crystallite. Apparently the lowest 
value of external applied stress matches the maximum shear stress. Physically that 
means that at the given step of calculation among all of the crystallites the only one 
will be involved into plastic deformation in which the maximum shear stress is at- 
tained and that is why the lowest external applied stress is necessary to bring it into 
action. 

We look for the coordinates of those crystallites with the lowest external applied 
stress. In such a way the coordinates of the crystallite and the external applied stress 
in this crystallite define the values, necessary for the beginning of a second cycle of 
calculations. In the present case the equation (9) can be written in the following form: 

y, 50 - (£ clij - £ alij ) sin a v cos oc, y - X d v y cos 2a, y 

U ext i j = U U U ’ V 1 V -V 

sin OLj cos a y 

where the sign of sum means the summation of the components of the stress tensor 
from RE at each step of the calculation. In the given example calculations of plastic 
deformation localization are performed only for one slip system in each crystallite. 

Consider the presence of two slip systems in each crystallite. The second slip sys- 
tem is oriented at an angle of 60° with respect to the first slip system. The orientation 
of the first slip system is put at random in each crystallite. The minimum external 
applied stress in each of the slip systems is defined separately and the minimum one 
is selected. In this way, the external applied stress and the coordinates of the crystal- 
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lite with maximum shear stress (and minimum external applied stress) are known. 
Now the next cycle of calculations can be performed. 

Let us next introduce the coefficient of work-hardening. For each crystallite let us 
introduce the coefficient K.., which is equal to 0 when the crystallite ij is not involved 
into plastic deformation and each time it is increased by 1 when the crystallite is in- 
volved into plastic deformation, so that K= K..+ 1. Let the resolved shear stress on the 
slip system in each crystallite follow the formulae: x„ = 50( 1 +KJ, where K t/ is equal to 
zero for all crystallites not involved into plastic deformation. Now (10) reads 

50(1 + ^)-(Io^-i;Oxy)sina j , cos - 1 cos 2a,, 

U ext ij= V ij V . ' 1 1 > 

sin a (> cos a,, 

There are two possibilities in accounting for work hardening in each crystallite. First, 
the acting of one slip system doesn’t depend on the acting of another one and the 
crystallite, once involved into plastic deformation according to achieving the critical 
shear stress in one slip system, can be involved into plastic deformation due to 
achieving a critical shear stress in another slip system. In other words, we set the two 
coefficients of work-hardening K I and A", for each of the two slip systems in each 
crystallite. Following the above-mentioned procedure we separate the calculation of 
minimum external applied stress in each of the two slip systems with separate arrays 
of K. for each of them, i. e. K u . for the first slip system and K 2r - for the second one. 
And secondly, the repeated involvement of the same grain into plastic deformation 
depends on the involvement of the crystallite into plastic deformation on the first slip 
system, i.e. we have only one coefficient of work-hardening for two slip systems in 
each crystallite. We consider two cases of work-hardening: 1) with interaction of two 
slip systems and 2) without interaction between them. Interaction between two slip 
systems means, that when one of the grains has been involved into plastic deforma- 
tion when achieving the critical resolved shear stress in one slip system, it cannot be 
involved again because a critical shear stress in another slip system needs to be 
achieved. Hence we put K lr> =0 before selecting the grain with minimum external 
applied stress in the second slip system. Along with accounting of linear dependence 
of work-hardening on the number of grains involved into plastic deformation an at- 
tempt to take work-hardening into account with the help of the power law expression 
for the stress-strain relationship a = Ke" pr where K was chosen, e.g., equal to 250 and 

n=0.2. 



3 Results and Discussion 

Using the above approach, the simulation of propagation of the sites of localized 
plastic deformation has been performed with accounting for work-hardening. The 
patterns of propagation of the bands of localized plastic deformation with work- 
hardening and the presence of only one slip system in each crystallite are shown in 
Fig. 3. The coefficient of work hardening was taken to be K= 4. As can be seen from 
the picture the bands quasi-homogeneously fill all the cross-section of the specimen 
with preferred (60°) direction of their propagation. The loading curve has a clearly 
pronounced increasing character. Because of the presence of only one slip system in 
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each of the crystallites the repeated involvement of the same grain requires an in- 
crease in the external applied stress K times higher than the critical shear stress and 
that practically exclude the repeated involvement of the same grains into plastic de- 
formation. More and more grains are involved into plastic deformation, which gives 
he pattern a scattered appearance: Localization is even observed at the first steps of 
simulation. 




Fig. 3. The evolution of the sites of localized plastic deformation in a polycrystal under tensile 
loading with one slip systems in each crystallite and with accounting for strain hardening (the 
value of relaxation is equal to 0.5 of external applied stress). 




Fig. 4. The evolution of the sites of localized plastic deformation in a polycrystal under tensile 
loading with two slip systems in each crystallite and with accounting for strain hardening (the 
value of relaxation is equal to 0.5 of external applied stress), the slip systems in each crystallite 
are dependent on each other. 
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Fig. 5. Patterns of propagation and loading diagram of hardening polycrystal with two slip sys- 
tem in each crystallite with strain hardening coefficient K= 4, the value of relaxation is 0.5 from 
the external applied stress, the slip systems in each crystallite do not depend on each other. 



In previous papers [5-9], strain-hardening was accounted for only by prohibition of 
the repeated involvement of grains into the process of plastic deformation. Such an 
approach resulted in the limited case of work-hardening. Here we account for work- 
hardening by introducing the coefficient of work-hardening, when calculating the 
external applied stress. That means that each repeated involvement of the same crys- 
tallite into plastic deformation is possible only when achieving the critical shear stress 
(50 MPa in our case) multiplied by the coefficient K. The critical shear stress is equal 
to 50 MPa. Setting the K being equal to 4 mean, that after the grain is involved into 
plastic deformation for the first time, at 50 MPa, its repeated involvement is possible 
only at T = 200 MPa. That gives the possibility to other grains, which have not been 
involved yet into plastic deformation, to be involved, as the resolved shear stress 
attains the critical value. 

Another pattern of the propagation of the bands of localized plastic deformation is 
obtained by accounting for the presence of two slip systems in each crystallite, Fig. 4. 
At this stage the sites of localized plastic deformation become more localized. They 
represent systems of intersecting bands, propagating across the cross-section of the 
specimen. At the N=50 th step of grains involvement into plastic deformation the two 
parallel bands appear. Then another band propagates from these initial ones, forming 
the system of intersecting bands. Fragmentation of the specimen by the band of lo- 
calized plastic deformation is clearly pronounced here. Starting approximately from 
step number 600 the involvement of new grains takes place, making impossible the 
repeated involvement of the same grain into plastic deformation, because in the given 
approach we set the increasing value of the external applied stress for all the grains, 
once involved into plastic deformation, independently on which system plastic slip 
occured. The loading diagram has increasing character but the average level of exter- 
nal applied stress goes somewhat lower, because two slip systems can act in the same 
crystallites. Shown in Fig. 5 is the case of the operation of two sliding systems in each 
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of the crystallites. As opposed to Fig. 4 here we take into consideration the slip sys- 
tems independently. This means that the same grain may be involved into plastic 
deformation according to the acting of two slip systems, first on one sliding system, 
then on another one, where involvement according to achieving the critical shear 
stress in one sliding system doesn’t depend on the acting of another one. The result- 
ing deformation pattern possesses a pronounced localized character. The loading 
diagram has increasing character, but the stress remains less than in the previous case, 
depicted in Fig. 4. The patterns of LPD band propagation, accounting of work- 
hardening with power law dependency are represented in Fig. 6. This is the case of 
operating of two independent slip systems in each crystallite and the value of relaxa- 
tion (stress drop) was taken to be equal to 0.2 of external applied stress. Because of 
the smaller magnitude of stress relaxation (0.2 instead of 05 of external applied 
stress), the amplitude of oscillations are smaller and it results in less pronounced 
localization. The curve increases significantly and at the N=801 (£ p = 0.0002%) it 
reaches the value 296 MPa. Fragmentation of the specimen by the LPD bands occurs 
here as in the case of accounting for linear work-hardening, depicted in Fig. 6. 




a, MPa t ci .=50 MPa 




Fig. 6. Loading diagram of hardening polycrystal with two slip systems in each crystallite with 
the power law hardening, the value of relaxation is 0.2 from the external applied stress, the 
slip systems in each crystallite are independent on each other. 



4 Conclusions 

The propagation of the LPD bands and influence of work-hardening on the patterns 
of development of the sites of plastic deformation has been analyzed within the 
framework of a meso-mechanical model of plastic strain localization. The model 
allows to simulate both non-hardening and hardening behavior of the polycrystal 
material under loading. Work-hardening was accounted for the case of interacting slip 
systems in each crystallite and not interacting ones. The comparison of the patterns of 
development of the sites of localized plastic deformation and loading curves for the 
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case of accounting of interaction of two slip systems and non accounting for ones 
shows, that in the latter case the pattern of plastic strain propagation has more pro- 
nounced localized character. The number of slip systems in each crystallite and the 
non-accounting of the interaction of the slip systems result in the enhancement of the 
localization, that is accompanied by a lower average level of external applied stress. 
The bands of localized plastic deformation proceed at an angle of 60° with respect to 
the tensile axis, is in agreement with the experimental findings [11,12], theoretical 
descriptions [5] and the results of calculations of Cleveringa et. al [15]. 
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Abstract. As it is well known forest fires can present serious risk to people and 
can have enormous environmental impact. Therefore researchers and land man- 
agers are increasingly interested in effective tools for use in scientific analyses, 
management and fighting operations. On the other hand forest fires are com- 
plex phenomena that need an interdisciplinary approach. In this paper the para- 
digm of Cellular Automata was applied and a model was projected to simulate 
the evolution of forest fires. The adopted method involves the definition of lo- 
cal rules, mainly based on fire spread relationships originally developed by 
Rothermei in 1972, from which the global behaviour of the system can emerge. 
The preliminary results show that the model could be applied for forest fire 
prevention, the production of risk scenarios and the evaluation of the forest fire 
environmental impact. 



1 Introduction 

Cellular Automata (CA) are good candidates for modelling and simulating complex 
dynamical systems whose evolution depends exclusively on the local interactions of 
their constituent parts [6]. A CA involves a regular division of the space in cells, each 
one characterised by a state that represents the actual conditions of the cell. The state 
changes according to a transition function that depends on the states of neighbouring 
cells and of the cell itself; the transition function is identical for all the cells. At time 
t=0, cells are in states, describing initial conditions, and the CA evolves changing the 
state of all the cells simultaneously at discrete times, according to the transition func- 
tion. 

The CA features (i.e. locality property) seem to match the “forest fire” system; the 
parameters, describing globally a forest fire, i.e. propagation rate, flame length and 
direction, fireline intensity, fire duration time etc. are mainly depending on some 
local characteristics i.e. vegetation type (live and dead fuel), relative humidity, fuel 
moisture, heat, territory morphology (altitude, slope), wind velocity and direction, 
etc. 

The fundamental studies of Rothermei [12], [ 13] point out these characteristics; his 
forest fire spread model is based on dynamic equations, that allow the forecast of the 
rate of spread and the reaction intensity knowing certain properties of the fuel matrix 
(fuel particle, fuel arrangement) and environmental conditions (slope, wind, fuel 
moisture) in which the fire occurs. 
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Some models were previously developed in terms of CA: Green [9] considered a 
two-dimensional square CA, where each cell is identified by fuel content and its 
combustion duration. The fire-spread mechanism is briefly the following: each burn- 
ing cell generates an ellipse with a focus at the cell centre; the ellipse dimension and 
orientation depends on the wind direction and strength. If a cell with fuel content is 
inside an ellipse, it will burn in the next steps. Gongalves and Diogo [8] developed a 
more complex probabilistic CA model based on the Rothermel’s equations of fire rate 
spread. A burning cell determines, according to the fire rate spread toward the direc- 
tion of neighbouring cells, an ignition probability. Karafyllidis and Thanailakis [10] 
developed a two-dimensional square CA which was only tested with hypothetical 
forests. Malamud and Turcotte [11] developed a statistical model of comprehensive 
effects in a forest for the two opposite phenomena of trees taking root in free areas 
and trees burnt by the fire. 

The proposed CA model is deterministic, it is related to Rothermel’s studies [12] 
and is partially based on the method described in [6] that may be applied to some 
macroscopic phenomena in order to produce a proper CA model. 

In the next section this paper illustrates the basic equations; the CA model formal- 
ization is described in the successive section; the fourth section treats the model im- 
plementations and preliminary results of applications; at the end comments conclude 
the paper. 



2 Fire Spread Prediction 

The majority of fire models in use today are mainly based on fire propagation rela- 
tionships experimentally developed in the U.S. by Rothermel [12], [13], Albini [1], 
Anderson [2], Fire geometry models were proposed by Anderson [3] and many other 
researchers. 

Rothermel’ s equations require a description of fuel which includes depth, loading, 
percentage of dead fuel, moisture of extinction, heat content, surface area to volume 
ratio, mineral content, silica content, and particle density. Required environmental 
variables include wind speed at half-flame height, slope and fuel moisture content 
(live and dead). Models based on Rothermel’ s equations usually perform adequately 
in predicting the detail of fire physics. 

Rothermel’s model leads to the fire maximum spread rate vector. To obtain a bi- 
dimensional description of fire behaviour it is generally assumed that the fire spread 
area is an ellipse [3] in which the source lies at one focus and the major axis is paral- 
lel to the maximum spread rate direction (which is influenced by local wind and 
slope). 

2.1 Basic Equations 

It is known that fires show a higher propagation velocity when they climb up an up- 
ward slope, whereas fires show a smaller velocity when they descend a downward 
slope. Also, wind speed and direction greatly affect forest fire propagation. Therefore, 
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according to the Rothermel’s fire study [12], maximum spread rate occurs in the 
direction of the resultant wind-slope vector: 

4> = <t + ^ (1) 



where (j) is in direction of maximum slope and its module is the slope effect, (j) H is in 

wind direction and its module is the wind effect. Slope and wind effect are respec- 
tively expressed as: 



<p s = 5215/3 0 3 tan of , and 
</> w = C (3.281 v) B (/5l/3 op y E 



( 2 ) 



where v is the wind speed, ft is the packing ratio of the fuel bed, p„ p is the optimum 
packing ratio, co is the slope (radians), C, B and E coefficients are functions of the 
fuel particle size in the fuel bed [1], [2], [12], [13]. 

The maximum fire spread rate (m min' 1 ), which accounts for local fuel characteris- 
tics, slope and wind effect, is computed as: 

^x=^o(l+ l<M). R 0 = I R ZKp b hQ ig ) (3) 



where R 0 is the spread rate on flat terrain and without wind; I R is the reaction inten- 
sity; / is the propagation flux ratio; Q ig is the heat of pre-ignition; p h is the ovendry 

bulk density; h is the effective heating number. 

The spread rate in an arbitrary direction is obtained assuming an elliptical shaped 
local spread (see fig. la): 



R = R 



max 



l-e 

1 — s cos 6 



(4) 



where the eccentricity £ of the ellipse increases as a function of the effective mid- 
flame windspeed v e according to the Anderson's empirical relation [3] as modified by 
Finney [7]: 




L =0.936 e a2566ve +0.461 e " 0 1548ve -0.397 



(5) 



The effective midflame windspeed v e , defined as the virtual windspeed that by it- 
self would produce the combined effect of slope and wind on fire spread rate, can be 
derived from eq. (3). 

It is worth noting that when v e = 0 (i.e. no slope and no wind) the eccentricity is 
zero and the ellipse reduce to a circle of radius Ro . 
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Fig. 1 . a) Elliptical fire spread area in which the source lies at one focus and the major axis is 
parallel to the maximum spread rate direction; b) Neighbourhood pattern and fire propagation 
in the proposed model: the cells inside the ellipses are burning and each ellipse represents a 
successive CA step 



3 The Cellular Automaton 

In order to avoid the spurious symmetries of the square cells, which characterize 
some previous CA based fire models [8], [9], [10], [11], the proposed model is based 
on a hexagonal lattice. The neighbourhood pattern includes the double order of hexa- 
gons around the cell, as shown in fig. lb. The further main points of the proposed 
model are described by the following: 

1 . The state of the cell must account for all the characteristics, relevant to the evolu- 
tion of the system and relative to the space portion corresponding to the cell; e.g. 
the altitude. Each characteristic must be individuated as a substate. The substate 
value is considered constant in the cell. 

2. As the state of the cell can be decomposed in substates, the transition function may 
be also split in many components. We distinguish two types of components: inter- 
nal transformation, depending on the substates of the cell and local interactions, 
depending on the substates of the cells in the neighbouring (e.g. fire propagation). 

3. Special functions must supply the history of “external influences” overall on the 
CA cells and direction. 

4. Some cells represent a kind of input from the “external world” to the CA; it ac- 
counts for describing an external initial influence, which cannot be described in 
terms of rules; e.g. the fire starting points. 

3.1 The Model Formalization 

We now give a formal description of the model which is a two-dimensional CA with 
hexagonal cells defined as: 

CA = <K, X, S, P, o, I, r> 

where: 

• K is the set of points with integer co-ordinates in the finite region, where the phe- 
nomenon evolves. Each point identifies a hexagonal cell. 
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• The set X identifies the geometrical pattern of the cells, which influences the 
change in the state of each one (the full second order neighbourhood, see fig. lb). 

• The finite set S of the states of the cell is: 

S = Sa x Sy x S T x Sh x Sc x Sd x Swd X Swr X (Sps) 18 

- S A , substate “altitude”, takes the altitude value of the cell; 

- S v , substate “vegetation”, specifies the type of vegetation, relative to the proper- 
ties of catching fire and burning. Fuel bed characteristics are specified according 
to the 13 format of fire behaviour fuel models used in BEHAVE [5]. 

- S T , substate “temperature”, takes the temperature value of the cell; 

- S H , substate “humidity”, takes the relative humidity value of the cell; 

- S c , substate “combustion” for the two possible fire types in the cell: “surface 
fire” and “crown fire”; it takes one of the values “non-flammable”, “inflamma- 
ble”, “burning” and ’’burnt” for each type. 

- S D , substate “duration", takes the value of the duration rest of the fire in the cell; 

- S WD , substate “wind direction”, takes the values of the wind directions (the eight 
directions of the wind rose) at ground level (that could be different from the free 
wind direction); 

- S WR , substate “wind rate”, takes the values of the wind rate (Km 0-60) at ground 
level (that could be different from the free wind rate); 

- S FS , substate “fire spread”, accounts for the fire spread from the central cell to 
the other neighbouring cells; 

- S FA , substate “fire acquire”, is just a renaming of S FS and individuates the fire 
propagation to the central cell from the other neighbouring cells. 

• P= [ p e , p, p t , p wd , p wr P w , P v } is the finite set of global parameters, which affect the 
transition function; they are constant overall in the cellular space: 

- p e , is the apothem of the cell. 

- p s , the time corresponding to a single step. 

- p t , the current time (month, day, hour and minute), 

- p wd , the free wind direction 

- p wr , the free wind rate; 

- P w , the set of weather condition parameters, varying during the day according to 
the season (currently P w contains the reference values of humidity and tempera- 
ture). Such parameters are correct considering the type of vegetation, the wind 
and in the case of a previous rain episode as far as a month 

- P v , the set of parameters concerning catching fire and burning, depending on the 
type of vegetation; they are also constant in time. 

Note that the function yi keeps p t up-to-date, the function y 2 supplies the history of P w , 

p w d and p m - at each CA step. 

• a: S 19 — >S is the transition function, that will be sketched in the next section; it 
accounts for the following aspects of the phenomenon: effects of combustion in 
surface and crown fire inside the cell, crown fire triggering off; surface and crown 
fire spread, determination of the local wind rate and direction. 

• IcK individuates the cells where the fire starts. 
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• r = { Yj, y 2 , y 3 , Y 4 } is the set of functions representing an external influence. They 
are computed at each step before the application of a. 

• Yi determines the current time of the CA step; 

• Y> supplies the weather conditions related to sun, the external wind direction and 
rate, for each CA step; 

• Yr Nxl— >S C accounts for external setting fire to cells of I at prefixed steps, where 
N is the set of natural numbers; 

• Yt : NxK— >Sc accounts for firemen intervention at prefixed steps. 



3.2 The Transition Function 

The transition function is defined as: 

0 = 0 D® 0 I2® 0 I1®°T1 

First the following internal transformation, concerning the effects of combustion in 
surface and crown fire inside the cell, is computed; 

Oxi • S v xS T xS| [XS ( xS[)XS\v[)XS\vr — ^ S^xSi [XS( XSd. 

Then the following local interactions are applied: 

- o„:(S A )%S WD xS WR 

- g I2 :(S fa ) 18 xS c -aS c 

- <V (S A ) 19 x S v x S c x S H x S T x S D x S WD x S WR -a (S fs ) 18 

where o n computes the wind direction and rate at the cell altitude, o I2 computes the 
change of combustion conditions in the current cell and o I3 computes the fire spread 
toward the neighbouring cells. 

Internal transformation T1 

When the substate S c is not “burning” it doesn’t change the substate S D , while the 
substates S H and S T vary their previous values on the basis of the weather change (P w ), 
the day hour (p , ), the wind (S WD and S WR ) and the vegetation type (S v and P v ). When 
the substate S c is “burning”, then the substates S T , S H , S c and S D depend on the previ- 
ous values of S H , S D and S T . The value 0 for S D determinates the change of S c to 
“burnt”. The conditions for the ignition from fire surface to crown fire are applied in 
the internal transformation T]. 

Local interaction II 

The computation of the substates S WD and S WR , “wind direction and rate” depends on 
p wd and p Hr that represent the values of the free wind. Such values are reduced in rela- 
tion to the altitude of the cell according to an empirical table of Rothermel [13], ob- 
taining the wind vector, related to the cell altitude. A corrective vector wind is com- 
puted, considering the angle between the slope relative to the cell and the free wind 
direction. It accounts for the absence of free wind, convective currents form along the 
maximum slope directions; their rate depends also on the weather and the hour of the 
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day. Some empirical tables of Rothermel [13] indicate the value of such vector. The 
new values of S W d and S W r are obtained adding the altitude vector to the corrective 
vector. 

Local interaction 12 

This function tests if the fire is spreading toward the central cell from the other cells of 
the neighbouring. If the combustion substate Sc is “inflammable”, then it changes to 
“burning”. 

Local interaction 13 

If the state S c is “burning” the following computation steps, depending on substates 
S v , S H , S T , S D- S WD , S WR and the set of parameters P v , are considered: 

a) First, using equations (1) and (2), the maximum spread vector is determined as the 
sum of slope effect (j) and wind effect (|) ; 

b) Using equation (3) the maximum spread rate R max is computed. 

c) An ellipse with a focus in the centre of the cell whose area depending on R max , is 
calculated using equation (5); the time corresponding to the CA step p s also affects 
the area of the ellipse (see fig. 1); 

d) The fire can propagate towards the neighbouring cells inside the ellipse, i.e. S F s 
(which is an alias of S FA ) takes the value “true”. 

4 Implementation and Preliminary Applications 

The implementation was developed in the programming language C++ using the 
OpenGL library for visualization. The main input are the DEM (Digital Elevation, 
Model) of the area, the gridded land cover (e.g. fuel kind) and the weather conditions. 

A first validation was tested on a real case of forest fire (fig. 2) in the territory of 
Villaputzu, Sardinia, on August 22” , 1998. This forest fire was a case treated and 
monitored in the milieu of European Research Program SALTUS, Environment and 
Climate [15]. The fire began at 12.30 and lasted less than 9 hours. The wind was in 
direction SE at a rate 20-30 Km/h. The burnt area was approximately 3 Km". The 
forest fire area in fig. 2 is illustrated by a legend specifying the different type of burnt 
vegetation. The dotted lines indicate the roads. The roads on the left and on the right 
of the initial area of the fire represent a barrier, because the human intervention to 
limit the fire spread is easily performed in the areas adjacent to roads. 

The simulated area (fig. 3 A, B, C, D) is visualised at each simulation step by a 
matrix with 170 rows and 130 columns; the hexagonal cell apothem is about m 20; 
the parameter p s is 2.5 min (as a result of a performed calibration procedure). The 
forest fire started in the area on the top left of fig. 2. In the simulation matrix, a bar- 
rier is introduced in the flat area to the right of the fire beginning area, as an easy 
human intervention was performed there. Step 30 of the simulation (fig. 3-A, 75 min- 
utes) represents the first cells burning. In the step 90 of the simulation (fig. 3-B, 3.75 
hours), the fire is rapidly reaching the mountain peaks. The fire is propagating ac- 
cording to the wind direction and the positive slope also allows the fire to expand 
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towards E, NE. Then the propagation towards the wind direction is almost exhausted 
and its rate slows with the negative slope (fig. 3-C, almost 7 hours). The only propa- 
gation direction is now SW and the fire proceeds very slowly (fig. 3-D, 8.3 hours). 

Fig. 4 represents the superimposition of the real fire of simulated fire. The result of 
simulation looks good, few areas of the real fire don’t appear to be burnt in the simu- 
lation and few areas, which were not burnt, are computed to be “burnt” in the simula- 
tion. 





A prevalent vegetation (maquis 
^ with dstus, phillyrea and lentiscus) 

D forest of conifers with q uerc us suber, 
p. lentiscus, phillyrea 
C vegetation of cistus 

D maquis with juniperus ph. 



Fig. 2. The forest fire of Villaputzu, August 22 nd , 1998 




Fig. 3. Sequence of four simulation steps for the test case in fig. 2: figures A, B, C, D corre- 
spond respectively to steps 30, 90, 170,200 
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Fig. 4. Simulated vs. real fire spread for the test case in fig. 2 




Fig. 5. CA model outcomes in a complex morphology 



Many other simulations have been performed (see fig. 5 for example) and cur- 
rently we are comparing the outcomes with those of a non CA based wildfire simula- 
tor (i.e. FARSITE simulator whose spread prediction is based on Huygen’s principle 

[7]). 

5 Conclusions 

The results of the simulations show a phenomenon development that fits significantly 
with real data. Because of the use of hexagonal cells, the proposed model does not 
present the spurious symmetries of the square lattice, which characterizes some pre- 
vious CA based fire model and which can affect negatively the simulation, especially 
in flat areas. Nevertheless it is necessary to stress that validation of fire spread models 
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is difficult because usually standard fire records don’t account for all the data neces- 
sary for simulation (e.g both the collected vegetation and weather data are often 
incomplete). Future work needs to concentrate on both including a more sophisticated 
fire description (e.g. spotting phenomena) and validating the model using observed 
fire growth patterns in a number of fuels, morphology and weather conditions. 
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Abstract. The modelling of wildland fire spread across a heteroge- 
neous landscape is significant because fire dynamics are sensitive to 
local spatial characteristics. The development of accurate fire models 
and simulations is important due to the economical and social losses 
wildland fire can cause and the resulting need to better understand, 
predict, and contain fire spread. We present a methodology for encoding 
the spread of wildland fire in a set of interacting automata. The Circal 
formalism is used to explicitly describe the transmission of fire as an 
interaction between discrete cells of landscape. We demonstrate the 
potential for the methodology to accurately model spatial dynamics by 
giving results of our implementation of a fire spread model that includes 
a heterogenous environment. 

Keywords: Modelling wildfire spread, cellular automata, Circal. 



1 Modelling Fire Spread 

Fire spread is a phenomenon that deserves attention from scientists; not just 
because it is socially and economically important, but also because the phe- 
nomenon is complex, difficult to model, and computationally expensive to sim- 
ulate. Indeed, there does not exist a verifiable method for better-than-real-time 
simulation of the phenomenon. The heterogeneity of a landscape is a specific 
problem of fire spread simulation. The landscape comprises the heterogeneous 
variables fuel (including fuel load, fuel type, moisture), slope of terrain, and wind 
direction and strength. In Section 2, we describe our method for capturing the 
heterogeneities of the landscape in the state of finite automata and for building 
a structure of connected automata as depicted in Fig. 1. We then describe a 
method for describing the interaction of these automata that is also heteroge- 
nous; the next-state transitions depend on the features of the landscape. 

Throughout the world, and specifically in Australia where the climate is 
dry and hot, wildland fire is a significant problem because it causes loss of life 
and property. The scientific challenge is to produce accurate, spatially dependent 
models and tractable, powerful simulations that can be used to investigate ‘what 
if’ questions, containment strategies, and to develop training tools for firefighters. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 395—404, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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Rothermel’s research on understanding the physics of fires [1], forms the 
basis for many approaches to modelling the behaviour of fires in wildland envi- 
ronments. Some approaches to modelling fire spread are discussed in Section 1.1. 



1.1 Related Research 

Previous approaches to modelling the fire spread phenomenon can be categorised 
as either empirically-based, physically-based, or a combination of both of these. 
The techniques used to implement these models include cellular automata tech- 
niques (such as Cell-DEVS and other formalisms that involve discretisation) [2, 
3,4,5], numerical implementations of differential equations [6, 7, 8, 9], fractal ge- 
ometry [5], and artificial neural networks [10]. In this section, we highlight the 
problems faced by researchers when trying to deal with the heterogeneous nature 
of real landscapes in fire spread simulations using cellular automata techniques. 

Bossert et al. [11] validate their simulation against a real scenario where the 
turbulence of wind (the turbulence is affected by wind speed and the fire itself) 
caused a marked increase in the spread of the fire due to convection. Research 
by Linn et al. [8,9] clearly demonstrates the notion that a heterogeneous envi- 
ronment produces phenomena that cannot easily be predicted by extrapolating 
fine-scale experimental data. 



current state 
active connection 
responsive port 




Fig. 1 . This figure shows the structure of a simple homogeneous landscape of 25 cells. 
The state of the centre- left cell (burning) causes a synchronisation of actions /events 
between the automaton of the centre-left cell and the automaton of the cell below it. 
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Clarke et al. [5] uses a cellular automaton to implement a model based on 
fractal geometry. The research is significant because historic data is used to 
calibrate the model. The argument is presented in support of the fractal nature 
of wildfire spread; the spread of wildfire is highly sensitive to the heterogeneities 
of the landscape and initial conditions, and fire is self-replicating. One of the 
distinct differences between the implementation by Clarke et al. and ours, is 
that we use heterogeneous wind and fuel and we do not use weighted, random 
processes to simulate the influence of heterogeneity of the landscape. 

Muzy et al. have developed several implementations of a semi-physical model 
of fire spread [2] based on Rothermel’s model [1]. They compare the differ- 
ent approaches to implementing the semi-physical model (namely DEVS and 
Cell-DEVS approaches [12,13]) and conclude that a more rigorous method of 
quantisation is required for more accurate and computationally less expensive 
simulations [4]. Muzy and Wainer [3] have developed such an approach using 
Cell-DEVS quantisation techniques and they conclude that they have reduced 
the time taken to run a simulation and increased the accuracy of the results. 
The discrete, event-based implementation of a model based on Rothermel’s orig- 
inal work has accounted for the time and space sensitivities of the fire spread 
simulation problem. 

Our approach to modelling the fire spread phenomenon is different to the 
approaches described in this section because we explicitly describe the interac- 
tions between each discrete cell in the simulation landscape. We discretise the 
landscape in a way that is typical to cellular automata but then we encode the 
spatial information of that cell as input of the state of an automaton and use the 
interactions between the automata of different cells to determine the behaviour 
in a process algebra way. 

In their Cell-DEVS approach, Wainer and Muzy [3] discretise Rothermel’s 
equation for fire spread by the state of the fire (heating, burning, and burnt), but 
they use continuous arithmetic to describe the heat exchange between cells. We 
avoid the use of continuous arithmetic for heat exchange by defining a discrete 
propagation delay (detailed in Sect. 2) that is discrete in both space and time, 
and relies on a discrete addition operator instead of a continuous multiplica- 
tive one. In Sect. 3 we show that we can achieve good results in the simplified 
homogeneous case using this technique. 

2 Modelling Spatial Dynamics via Interacting Automata 

Our approach for modelling the fire spread phenomenon is to use interacting au- 
tomata. This approach requires the discretisation of the landscape into (usually) 
equally sized cells. The discretisation of the spatial information that determines 
the spread on a cell-by-cell basis is encoded within each cell’s automaton in terms 
of state and a next-state transition function. 

The automaton that corresponds to a specific cell is a finite automaton (de- 
terministic or probabilistic) with a set of states S, alphabet V, a next-state 
transition function S, and an initial state i. In this section we present an argu- 
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ment for this approach and show how to encode the fire spread model in a set 
of interacting automata. 

How is the information encoded in the automaton? The state of the 
automaton captures the spatial features of the particular cell to which it belongs. 
The state (s £ S) of the automaton is defined by the set comprising slope 
gradient, slope orientation, wind, and fuel as follows: 

state s : S = F x T x W, where 
T = St x O, 

St = {flat, slight, mid, steep}, 

O = {n, s, e, w}, 

W = {/, n, nw, w, sw, s, se, e, ne}, and 
F = {unburnt, burning, burnt}. 

The above sets refer to the gradient (slope St), the aspect (orientation O) of 
the terrain T, the wind direction W, and the state of the fuel F at the location 
of each cell. Figure 2 shows the deterministic finite automaton in which this 
information is encoded for the example where the slope is mid-ranged (mid), 
the orientation of the slope is northerly (n), the wind is easterly (e), and the 
fuel is not yet burnt (unburnt). 

The state of an automaton captures the spatial information of the cell to 
which it belongs, and this encoding is therefore a heterogeneous description of 
the landscape. Figure 2 depicts the state and labelled transitions of the finite 
automaton where the cell slopes upwards towards the north and has an east- 
erly wind direction. We have used a similar discretisation of fuel as Wainer and 
Muzy [3], but we also use discrete values for the description of wind and terrain. 
The result of this approach is a set finite automata that are encoded with in- 
formation describing the landscape as a combination of terrain, wind, fuel, and 
fire. 



2.1 The Significance of the Circal Formalism 

To allow the automata to interact, we need to capture explicit, non-lromogeneous 
interaction between automata. For this, we use Circal [14,15,16], a process al- 
gebra that has been used for describing and verifying complex systems such as 
digital hardware including asynchronous logic and communication protocols. In 
this section, we detail the use of the Circal formalism as a specification language 
for encoding the spatial dynamics of fire spread as a set of connected and in- 
teracting automata, after the landscape has been discretised using the cellular 
automata paradigm. 

Circal is an appropriate formalism for this approach because it has the neces- 
sary constituents to permit these modelling concepts to be well-captured [17,18]. 
Circal is a rigorous formalism and allows concurrent communication between an 
arbitrary number of processes. We use Circal to explicitly describe the interac- 
tions between automata and encode the spatial information of each cell in the 
states of each automaton. 




Modelling Wildfire Dynamics via Interacting Automata 399 



How do the automata interact? We impose a structure on the landscape as 
we discretise it; defining areas where fire communication may occur. The adjacent 
neighbours of a cell can see an abstracted view of the automata because of the 
way the structure is built. Below, we describe the terminology of the structure, 
and the procedure for determining the next state transition function. 

The communication of fire spread between cells is captured by the actions of 
each automaton in conjunction with automata that it connects itself with. The 
actions (or transitions) of each automaton are the alphabet set E. We say that 
two automata are connected if their associated cells are neighbours as depicted 
in Fig. 1. In the case of the simulations described in this paper, neighbours will 
always include the area of cells within a radius (in the cellular automata sense of 
the word) of two from the cell in question, but two automata could be connected 
arbitrarily within the landscape using the methodology. This is equivalent to 
the neighbourhood term used in cellular automata based simulations. Figure 1 
shows a landscape of size five by five and depicts all the cells that make up 
the neighbourhood of the centre cell. The centre-left cell and its connections are 
highlighted in the figure. The specific terms used in the figure are defined in the 
following paragraphs. 




Fig. 2. This figure depicts the state and transition actions of a deterministic finite 
state automaton that encodes the slope, terrain, and fuel information of a cell in a 
landscape. This cell slopes upwards towards the north, has a midranged gradient, and 
has an easterly wind direction. 
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In the Circal formalism, the composition operator [18] uses similarly labelled 
actions to ‘wire together’ structures such as the cells in Figure 1. The dashed 
lines in Figure 1 are connections between connected automata are built using the 
composition operator. This is called the synchronisation- of- actions technique. 

Cerone and Milne [19] describe a process for the hierarchical description of 
interacting automata in their research into asynchronous micropipeline digital 
electronics. The connections between cells are the medium for the spread of fire 
from one cell to another and are built between similarly labelled ports. The ports 
are associated with the actions in the automaton that are not abstracted as a 
part of the procedure described below: 

1. All actions are relabelled to have the same name as the port they are con- 
nected to or to an anonymous name if they are not connected to a port. 

2. The separate cells are composed ; similarly named ports are ‘wired up’ with 
connections. 

3. The actions inside each cell that are not connected to a port cannot be seen 
from outside the box and are abstracted. 

Figure 1 shows an example of the structure that has been connected using the 
process described above. In this case, each of the actions in an automaton’s 
alphabet E have been relabelled to reflect the coordinates of the associated cell in 
the landscape. The automata are composed to build connections using the Circal 
formalism’s composition operator and the alphabet is reduced (abstracted) to 
remove the actions that are not visible from outside the cell. 

The synchronisation-of-actions corresponds to the next state transitions func- 
tion 5 in the automata with the actions taken from the alphabet E. In this case, 
the transition function S becomes a function that determines the next state of 
each automata based on a current set of enabled actions and hence indirectly 
on the state of the automata in the connected cells. An action is enabled if it 
can synchronise with the other similarly labelled actions through a responsive 
port. A port becomes responsive when all the associated actions emanate from 
the current states of the automata that they are associated with. 



Why is it good to do it this way? We have developed a method for describing 
heterogeneous landscapes using finite automata and using a rigorous formalism 
to describe the interactions between the automata. The methodology couches 
the structure and operation of simulations in the paradigm of cellular automata. 

Unlike the cellular automata approach, we define heterogeneous landscapes 
by encoding the spatial information as the state of each automaton. Like the 
classical cellular automata approach, the state information of the neighbours of 
a cell determines the next state of the cell. Although we use cellular automata 
principles to discretise the time and space of the model, the interactions between 
the cells are defined explicitly as a set of actions using the synchronisation-of- 
actions technique rather than as a homogeneous update. 
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3 Experiments and Results 

We have identified heterogeneity of landscape as an important feature of mod- 
elling the fire spread phenomenon. In this section, we describe the experiments 
we have carried out using an implementation of the fire spread model. The main 
aim of these experiments is do demonstrate the effect of the abstractions we 
have made on the shape of the fire spread and to show the effect of simulating 
the heterogeneous landscape explicitly using interacting automata. All of the 
experiments conducted use a landscape with homogeneous fuel. 




Fig. 3. This figure shows the effect of 
terrain on fire spread. The contours indi- 
cate the position of the fire front at regu- 
lar intervals. The topography is indicated 
by darker colours at lower elevations. 




Fig. 4. This figure views a section of the 
simulation results presented in Figure 3 
from a different position, highlighting the 
effect of terrain on the rate of spread 
of the fire. The fire front spreads more 
quickly on the uphill slope at the middle- 
top of the figure and less quickly on a the 
downhill slope on the middle-left of the 
figure. 



Hargrove et al. describes how the shape of a fire front approximates an ellipse 
after it has burned for a period of time [20]. In the case of a homogeneous 
environment (homogeneous wind, flat terrain, and no wind), the ellipse degrades 
to a circle. Figure 3 demonstrates the difference between a near-flat terrain and 
a hilly terrain in a scenario that uses a homogeneous fuel landscape and no wind. 
More detail is shown in Fig. 4, which is a different view of a section of the results 
presented in Fig. 3. The results show the circular contours on flat terrain (the 
right hand side of Fig. 3) and the effect of terrain on the rate of spread of the 
fire. 

Balbi et al. and Santoni et al. describe the effect of slope on wildfire 
spread [21,22] and the shape can be approximated by a double ellipse. From 
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visual inspection of the results, we can conclude that the results of our experi- 
ment give a reasonable approximation to both the circular shape we expect in 
the case where the terrain is relatively flat and the elliptical shape we expect 
on a hilly terrain. Figure 4 gives a clearer view of the effect of slope on wildfire 
spread in our experiment that corresponds to the shape described by Balbi et 
al. and Santoni et al. 

When wind is introduced into the scenario described above, the results show 
that the effects of both wind and terrain on the shape of a spreading fire front are 
captured by the implementation that uses the interacting automata approach. 
The shape produced in Fig. 5 is a good approximation of the ellipse expected 
for a northerly wind direction on a relatively flat terrain. The effect of both 
wind and terrain can be seen in Fig. 6 where the effects of wind and terrain are 
captured together, giving a much more irregular spread pattern, as would be 
expected from knowledge of actual fire spread. 




Fig. 5. This figure shows the effect of 
wind on a relatively flat terrain. An as- 
terisk marks the ignition point, and the 
wind is homogeneous and its direction 
is from the top to the bottom of the 
figure. The contours represent the front 
edge of the spreading fire at regular inter- 
vals. The topography is indicated using 
a darker colour for lower elevations. 




Fig. 6. A view of two types of terrain 
is presented in this figure. The top-left 
half of this figure represents a hilly ter- 
rain and the bottom-right of this figure 
represents a flat terrain. The wind is ap- 
proaching from the top-right corner of 
the figure and the topography is indi- 
cated using a darker colour for lower el- 
evations. 



4 Discussion 

In this paper, we have discussed the significant problem of wildfire spread mod- 
elling and introduced a methodology that can be used to capture spatial hetero- 
geneity via state encoding and explicit actions given by current state. A method 
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for capturing spatial heterogeneity within automaton state has been presented 
and utilised in a simulation environment. The suitability of the interacting au- 
tomata approach to wildfire spread modelling has been demonstrated using the 
implemented simulator. 

The results demonstrate the ability of the methodology to capture spatial 
heterogeneity of a landscape and simulate the spread of wildfire on a landscape 
that is regular and non-uniform. We have not validated the model against factors 
such as fuel loads, topography, and varying wind strength. The experiments de- 
tailed in the previous section are centred on the demonstration of the interacting 
automata approach and have shown that it can be used for heterogeneous en- 
vironments, reproducing the elliptical shape expected of a simulation of wildfire 
spread. The approach effectively captures the information about the landscape 
as the state of an automaton, and uses explicit communication between cells to 
describe the change in state over the landscape as the fire spreads. 
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Abstract. A probabilistic cellular automaton is developed to study the 
combined effect of competition and assortativity on the speciation pro- 
cess in the absence of geographical barriers. The model is studied in the 
case of long-range coupling. A simulated annealing technique was used in 
order to find the stationary distribution in reasonably short simulation 
times. Two components of fitness are considered: a static one that de- 
scribes adaptation to environmental factors not related to the population 
itself, and a dynamic one that accounts for interactions between organ- 
isms such as competition. The simulations show that both in the case of 
flat and steep static fitness landscape, competition and assortativity do 
exert a synergistic effect on speciation. We also show that competition 
acts as a stabilizing force preventing the random sampling effects to drive 
one of the newborn populations to extinction. Finally, the variance of the 
frequency distribution is plotted as a function of competition and assor- 
tativity, obtaining a surface that shows a sharp transition from a very 
low (single species state) to a very high (multiple species state) level, 
therefore featuring as a phase transition diagram. Examination of the 
contour plots of the phase diagram graphycally highlights the synergetic 
effect. 



1 The Problem of Sympatric Speciation 

The notion of speciation in biology refers to the splitting of an original species 
into two fertile, yet reproductively isolated strains. The allopatric theory , which 
is currently accepted by the majority of biologists, claims that a geographic 
barrier is needed in order to break the gene flow so as to allow two strains to 
evolve a complete reproductive isolation. On the other hand, many evidences 
and experimental data have been reported in recent years strongly suggesting 
the possibility of a sympatric mechanism of speciation. For example, the compar- 
ison of mythocondrial DNA sequences of cytochrome b performed by Schlieven 
and others [1], showed the monoplryletic origin of cichlid species living in some 
vulcanic lakes of western Africa. The main features of these lakes are the envi- 
ronmental homogeneity and the absence of microgeographical barriers. It is thus 
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possible that the present diversity is the result of several events of sympatric 
speciation. 

The key element for sympatric speciation is assortative mating that is, mat- 
ing must be allowed only between individuals whose phenotypic distance does 
not exceed a given threshold. In fact, consider a population characterized by a 
bimodal distribution for an echological character determining adaptation to the 
environment: in a regime of random mating the crossings between individuals 
of the two humps will produce intermediate phenotypes so that the distribution 
will never split. Two interesting theories have been developed to explain the 
evolution of assortativity. In Kondraslrov and Kondrashov’s theory [2] disruptive 
selection (for instance determined by a bimodal resource distribution) splits the 
population in two distinct echological types that are later stabilized by the evo- 
lution of assortative mating. The theory of evolutionary branching developed by 
Doebeli and Dieckmann [3] is more general in that it does not require disruptive 
selection: the population first converges in phenotype space to an evolutionar- 
ily attracting fitness minimum (as a result of common echological interactions 
such as competition, predation and mutualism) and then it splits into diverging 
phenotypic clusters. For example [4], given a gaussian resource distribution, the 
population first crowds on the phenotype with the highest fitness, and then, 
owing to the high level of competition, splits into two distinct groups that later 
become reproductively isolated due to selection of assortative mating. 

In the present paper we will not investigate the evolution of assortativity 
that will be treated as a tunable parameter in order to study its interplay with 
competition. In particular we will show that: (1) assortativity alone is sufficient 
to induce speciation but one of the new species soon disappears due to random 
fluctuations; (2) stable species coexistence can be attained through the intro- 
duction of competition; (3) competition and assortativity do exert a synergistic 
effect on speciation so that high levels of assortativity can trigger speciation 
even in the presence of weak competition and vice versa; (4) speciation can be 
thought of as a phase transition as can be deduced from the plot of variance 
versus competition and assortativity. The use of a simidated annealing method 
enables us to find stationary or quasi-stationary distribution in reasonably short 
simulation times. 



2 The Model 

We consider a constant population of N haploid individuals modeled as binary 
strings. Each bit represents a locus and the Boolean values it can take on repre- 
sent alternative allelic forms, namely, g- t = 0 represents the wild-type allele and 
gi = 1 is the least deleterious mutant. In agreement with the theory of quanti- 
tative characters [5] we assume that the effects of genes are small, similar and 
additive, so that the phenotype can be expressed by the sum of the bits of the 
genotype. 

The environment is modeled as a probabilistic cellular automaton with a large 
number of states, one for each different genotype. All the cells of the lattice are 
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occupied by organisms and the population size remains constant throughout the 
simulation. The evolution of the automaton is given by the application of three 
rules: recombination, reproduction and mutation. 

The recombination rule is implemented as follows, adopting the point of view 
of an individual of phenotype x located in a cell of a lattice. 

Recombination. Each invividual looks for a potential partner within its mat- 
ing range (dependent on the phenotypic distance) in a given neighborhood 
of radius p. For each partner found, uniform recombination is applied so as 
to produce an offspring with phenotype 2 . A(z) copies of the recombinant 
offspring are then stored in a temporary offspring. 

Reproduction. As the population size must remain constant, an individual is 
randomly chosen from each offspring array which then replaces the parent. 
Mutation. For each individual of the new generation, flip a randomly chosen 
bit with probability /./,. 

As the model is very complex we will study it using a mean-field approx- 
imation that enables us to disregard the spatial structure. The approximation 
becomes accurate in the limit of a very large radius of the neighborhood so that 
the neighborhood will extend on all the lattice. 

In order to perform the mean-field analysis, let us first define the mating range 
A as the maximal phenotypic distance between two indivibuals, still compatible 
with mating; we also define assortativity as A = L — A. In a more formal way, 
the mating preference function can be defined as: 



m A (y,z) 



0 if \y — z\ > A , 

1 otherwise, 



Once an organism has chosen a suitable partner we apply uniform recombi- 
nation i. e. for each locus the offspring might receive an allele from either parents 
with equal probability. Uniform recombination therefore implies absence of link- 
age which is a reasonable approximation only in the case of a very long genome 
distributed on many independent chromosomes. 

We illustrate the most important features of the model in the mean-field 
approximation. Let us introduce the overlap q between two genotypes (<7i, 52 ; • • • ) 
and (hi,h, 2 , ■ ■ ■) as the number of loci i for which = hi = 1. For a pair of 
phenotypes x < y, it can be shown [6] that the overlap must always be in the 
range max(0, x + y — L)<q<x. The probability Q X y(q) is 



Qxy(q) — 




Due to the property of binomial coefficient that (£) = 0 if b < 0 or b > a, 
Qxy(q) 7 ^ 0 only for max(a: + y — L, 0) < q < min (x,y). Notice that using this 
property we can freely sum over q obtaining Q xy (q) = 1. 
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If phenotype y belongs to the mating range of phenotype x, it can be 
proven [6] that the phenotype 21 of the offspings must lie in the range q < 
z < x + y — q. The probability R% y {z) that two phenotypes x and y with given 
overlap q originate a phenotype z is 



R q xy{z) 



fx + y- 2 q\ l *+*~ 2 ® 

V z-q ) 2 



Again, due to the properties of binomial coefficients, R% y {z) 7^ 0 for q < z < 
x + y — q- Summing over z one has Yh z R xy( z ) = 1- 

Finally, let us define A(z) as the number of offsprings with phenotype z 
generated in each suitable crossing. Using this quantity, the selection step will be 
implicitly performed within the reproduction step. A(z), in fact can be thought 
of as a quantity related to the probability that an individual with phenotype z 
may survive till the reproductive age. 

We can now compute the number of individuals with phenotype z in the 
offspring generation: 



- A ( 2 ) EE ”M!S-M E «, ( , )s , (2) , 

x y ' ' q 

where the factor 

M(x) = m A \(x, k)n(k) 



(1) 



is just the total number of individuals in the mating range of phenotype x. 

If N is the constant size of the population, we must now introduce a nor- 
malization factor C such that ^ Yh z n '( z ) = N. If we introduce the quantity 
M{ x ) = J2k mA ( x ’k)p(k)i it can be noticed that M(x) = NM( x) and the 
normalization factor can be expressed as: 



c = E- 4 «EE 

z X y 



p{x)p{y)mA{x,y) 

M(x) 



Y.Q*y^ R Uz) 



(2) 



The new frequency of phenotype z after the normalization procedure, which 
corresponds to the reproduction step of the microscopic model can be therefore 
computed as: n"(x) = 

Finally we can implement the mutation step that does not affect the total 
size of the population as 



'(z) = (! - p)ri\z)+n 



L-z + 1 
L 1 



^-1) + 



z+1 



'( z +l) 



(3) 



where L is the genome lenght and y is the per bit mutation rate. It is therefore 
L z 1 

clear that y is the probability that an individual of phenotype z — l 



L 

undergoes a mutation on a bit 0 while 



z + 1 



y is the probability that an individual 
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with phenotype z + 1 mutates a bit from 1 to 0. The structure of Equation 3 
therefore reflects the fact that in our model we implement only short range 
mutations allowing an individual with phenotype x to turn only in one of the 
neighbouring phenotypes x — 1 and x + 1. 

As the fitness is related to the survival probability, it is defined as the ex- 
ponential of the fitness landscape: A(x,t) = e H ^ x,t \ An important feature of 
our model is that we consider two components of fitness. The static component 
describes the adaptation to environmental factors not related to the population 
itself such as climate or temperature, while the dynamic component relates to 
the interactions with other members of the population and therefore it changes 
in time as a function of the phenotype frequencies. In our model we consider 
only competition. The static component of the fitness landscape is defined as a 
function of the phenotype x: 



H 0 (x) = e M?) /3 

This function becomes flatter and flatter as (3 is increased: for [3 — > 0 the 
function becomes a sharp peak at x = 0; when (3 = 1 the function is a declining 
exponential whose steepness increases with T. Finally for (3 — > oo the fitness 
landscape is constant in the range [0,T] and zero outside. The dynamic part of 
the fitness has a similar expression, with parameters a and R that control the 
steepness and range of competition among phenotypes. The complete expression 
of the fitness landscape is: 

H(x,t) = H 0 (x) - J^e~«W\ p(y,t) (4) 

v 

The parameter J controls the intensity of competition with respect to the 
arbitrary reference value Hq(0) = 1. If a = 0 an individual with phenotype x is 
in competition only with other organisms with the same phenotype; conversely in 
the case a — > oo a phenotype x is in competition with all the other phenotypes 
in the range [x — R, x + R] , and the boundaries of this competition intervals 
blurry when a is decreased. 

In the limit of an infinite range of neighborhood p — > oo, for the sake of 
computational efficiency we implemented our model as follows. The initial pop- 
ulation is chosen at random and stored in a bidimensional matrix with L + 1 
rows and N columns. Each row represents one of the possible phenotypes; as the 
whole population might crowd on a single phenotype, N memory locations must 
be allocated for each phenotype. Each generation begins with the reproduction 
step. The first parent is chosen at random; in a similar way, the second parent 
is randomly chosen within the mating range of the first one. 

The offspring is produced through uniform recombination, i.e., for each locus 
it will receive the allele of the first or second parent with equal probability; the 
recombinant then undergoes mutation on a random allele with probability p(t). 
The newborn individuals are stored in a second matrix with L + 1 rows and N 
columns. The reproduction procedure is followed by the selection step. As we 
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consider a constant size population, a cycle is iterated until N individuals are 
copied back from the second to the first matrix. In each iteration of the cycle, an 
individual is chosen at random and its relative fitness is compared to a random 
number r uniformely distributed between 0 and 1: if r < A(x)/ A the individual 
survives and is passed on to the next generation, otherwise a new attempt is 
made. 

In order to speed-up simulations in computer time, we adopted a simulated 
annealing technique: the mutation rate /r depends on time as 

MW = 2 M °° - tanh 

which roughly corresponds to keeping /i = /io (a high value, say 10/./V) up to 
a time t — 6, then decrease it linearly up to the desidered value n oo is a time 
interval 26 and continue with this value for the rest of simulation. 

In the present work we also studied the behaviour of mean fitmess H during 
evolution. According to Fisher’s fundamental theorem of natural selection [7] 
and Holland’s schemata theorem [8], in fact, the average fitness should always 
increase. However, according to the modern interpretation of Fisher’s theorem 
by Price [9] and Ewens [10] the total change in mean fitness is the sum of a 
partial change related to the variation in genotypic frequencies (operated by 
natural selection) and a partial change related to environmental deterioration, 
and Fisher’s theorem predicts that the first contribution only is always non- 
negative. The schemata theorem on the other hand, states that schemas with 
higher than average fitness tend to receive an exponentially increasing number 
of samples in successive generations. As a consequence, the average fitness of 
the population is expected to increase as time goes by. Holland’s theorem, how- 
ever, is not general in that it is based on the assumption of fixed fitness for 
each phenotype. In realistic situations, fitness is frequency dependent, so that 
as the population becomes richer and richer in fitter individuals the intensity of 
competition also increases therefore counteracting a further increase in the fre- 
quency of the best adapted phenotypes. This increase in competition intensity 
is exactly what was meant by deterioration of the environmental conditions in 
Price and Ewens’s reformulation of Fisher’s theorem [9,10]. In our formalism, 
the deterioration is explicitly introduced by the competition term. 

3 Simulations 

The simulations we performed clearly show the synergistic effect of competition 
and assortativity in inducing speciation. As an example consider the case of a 
flat static fitness landscape. In a regime of random mating ( A = 14 or, equiv- 
alently, A = 0) speciation is never possible however high the competition level 
may be chosen. Even choosing an extremely high competition intensity J = 16 
and competition range R = 7 (we chose a genome lenglrt L = 14 so that the 
phenotype can only take on values in the range [0, 14]) the frequency distribution 
only becomes trimodal without splitting because the crossings between the two 
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extreme humps always reproduce the intermediate phenotypes. This shows that 
while competition was sufficient to induce speciation in non-recombinant popu- 
lation (see Bagnoli [11]) when sexual reproduction is introduced this is no longer 
true. The missing ingredient for the speciation of recombinant populations is as- 
sortativity. If we choose a very high level of assortativity such as A = 1 (A = 13), 
speciation is indeed possible even in the absence of competition ( J = 0) but one 
of the newborn species soon disappears due to random fluctuations and the final 
distribution shows only a single delta peak. It can be therefore concluded that 
assortativity allows speciation but not the stable coexistence of the new species. 
We found that competition acts as a stabilazing force: in fact if we keep a very 
high level of assortativity A = 1 {A = 13), but we introduce a weak competition 
(J = 1, R = 2), the final distribution will be characterized by the stable coexis- 
tence of two or three species represented by delta-peaks located in the phenotypic 
space so as to minimize the inter-specific competition. The stabilizing effect of 
competition can be easily explained if we recall that competition decreases with 
the phenotypic distance between the interacting species and becomes maximal 
between organisms with the same phenotypes. Consider a frequency distribu- 
tion with two delta peaks: there will be intra-specific competition within each 
peak and inter-specific competition between the peaks. Suppose the two peaks 
at the beginning were equally populated and then due to random fluctuations 
the frequency of the first peak decreases while the frequency of the second one 
increases. In the first peak there will be a decrease in intraspecific competition 
and thus an increase in fitness leading to an increase in frequency whereas in the 
second peak the decrease in fitness due to the higher intraspecific competition 
lowers the frequency. As a consequence the frequency of both peaks will oscillate 
around 50% and none of them will be eradicated. 

The simulations also show the synergistic effect of competition and assorta- 
tivity. Speciation in fact, is possible also in the presence of a very large mating 
range such as A = 4 {A = 10) if we set a sufficiently strong competition: J = 2, 
R = 4. For the sake of comparison, consider that with A = 1 (A = 13) it is 
sufficient to set J = 1, R = 2. The evolutionary dynamics of the simulation with 
A — 4 (A = 10), J = 2, R = 4 can be followed if we plot mean, variance and 
mean fitness of the distribution. At the beginning of the run the initial distribu- 
tion (we chose a delta peak in x = 0 but the results are not dependent on the 
initial distribution) splits in two bell-slraped distributions at the opposite ends 
of the phenotypic space. This corresponds to an abrupt increase of the mean 
and variance as well as of the average fitness: if the population is distributed 
over several phenotypes, the competition pressure is relieved. After t = r the 
mutation rate decreases and the bell-shaped distributions are narrowed first in 
the regions pointing towards the center of the space (which leads to a further 
little increase in variance and mean fitness) and then in the regions pointing 
towards the ends of the space (so that the variance decreases again). The final 
distribution is represented by two delta peaks each accounting for about 50% 
of the population. The plots of mean, variance and mean fitness are shown in 
Figure 1. 




412 F. Bagnoli and C. Guardiani 



-0.2 
_ - 0.3 

r °- 4 

: - 0.5 
-0.6 
- 0.7 
-0.8 
- 0.9 






15000 20000 

generations 





j 


' Mean 

Vanance 




ti>r- 

















10000 15000 20000 25000 

generations 



Fig. 1. Average fitness, variance and mean of the population distribution in a regime of 
weak assortativity (A = 4 or A = 10), high competition intensity and long competition 
range (J = 2, a = 10, R = 4). Annealing parameters: fj, o = 10 _1 , /Too = 10 — 6 , 
t = 10000, 8 = 3000. Total evolution time: 30000 generations. 



In order to perform systematic simulations in a wide range of values of 
competition and assortativity, a suitable parameter must be chosen to moni- 
tor the speciation process. We chose the variance of the distribution: var = 
— x) 2 p(xi). The simulations, in fact, show that, as competition and/or as- 
sortativity is increased, the frequency distribution first widens, then it becomes 
bimodal and eventually it splits in two sharp peaks that move in the phenotypic 
spaceso as to maximize their reciprocal distance: each of these steps involves an 
increase in the variance of the frequency distribution. 

As the simulations discussed so far refer to a flat static fitness landscape, for 
the sake of completeness, we will now show the plot of variance as a function of 
competition and assortativity in the case of steep static fitness. We graphically 
illustrate the synergistic effect of competition and assortativity on speciation in 
the contour plots shown in Figure 2. 

The contour plots divide the J, A plane in two regions: the area on the left 
represents the state with a single quasi-species whereas the area on the right 
represents the state with two or more distinct quasi-species. If a point, owing to 
a change in competition and/or assortativity, crosses these borderlines moving 
from the first to the second region, a speciation event does occur. It should be 
noted that in the high competition region, the contour plots tend to diverge from 
each other showing a gradual increase of the variance of the frequency distribu- 
tions. This is due to the fact that, even if competition alone is not sufficient to 
induce speciation in recombinant populations, it spreads the frequency distri- 
bution that becomes wider and wider, and splits into two distinct species only 
for extremely high assortativity values. In this regime of high competition only 
the ends of the mating range of a phenotype x are populated and the crossings 
between these comparatively different individuals will create once again the in- 
termediate phenotypes preventing speciation until assortativity becomes almost 
maximal. The down sloping shape of the contour plots is a strong indication 
of a synergistic interaction of competition and assortativity on the speciation 
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Fig. 2. Contour plots var = 5, 10, 15, 20, 25, 30, 35 for steep static fitness land scape. 
Parameters: (5 = 1, T = 10, a = 2, R = 4, p = 0.5. Annealing parameters: // o = 10~ 1 , 
IXm = 10 — 6 , r = 10000, 5 = 3000. Total evolution time: 30000 generations. Each point 
of the plot is the average of 10 independent runs. 



process. The contour plots show that for moderate competition there is a syner- 
gistic effect between competition and assortativity since a simultaneous increase 
of J and A may allow the crossing of the borderline whereas the increase of a 
parameter at a time does not. On the other hand, for larger values of J the phase 
diagram shows a reentrant character due to the extinction of one of the species 
that cannot move farther apart from the other and therefore cannot relieve com- 
petition anymore. It can also be noticed that for J = 0 the contour plot shows 
a change in slope due to extinction of one species owing to random fluctuations. 



4 Conclusions 

We presented a microscopic model to investigate the evolution of sympatric spe- 
ciation. The model is equivalent to the mean field approximation of a probabilis- 
tic cellular automaton with a very large number of states. The simulations were 
speeded up through the use of a simulated annealing technique. We showed that 
assortativity alone is sufficient to induce speciation, but competition is necessary 
for the stable coexistence of the new species. Moreover, competition and assor- 
tativity influence speciation in a synergystic way as is evident from the down 
sloping contour lines of the plot of variance as a function of competition and 
assortativity. The contour plots also show that speciation can be seen as a phase 
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transition and the system shifts from a state with a single species to a state with 
two or several species as a threshold of competition and assortativity is crossed. 
Finally, it is quite interesting to observe the behavior of the average of the fitness 
landscape H with respect to time in the various simulations. In general there is a 
large variation in correspondence of the variation of the mutation rate. One can 
observe that there are cases in which H increases smoothly when decreasing /x, 
while others, in correspondence of weak competition and moderate assortativity 
exhibit a sudden decrease, often coupled to further oscillations. At the micro- 
scopic level, this decrease corresponds to extinction of small inter-populations 
that lowered the competition levels. 

As explicitely declared, the present work focuses on the mean-field approxi- 
mation of a Cellular Automaton. Simulations of the CA with finite neighborhood 
are currently in progress, and the first results will be presented elsewhere. It is 
also our intention to study the evolutionary dynamics in the case of randomly 
chosen connections between interacting individuals. 
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Abstract. A Cellular Automata methodological approach for modelling large 
scale (extended for kilometres) surface flows (e.g. lava flows, debris flows etc.) 
is here presented. It represents an improvement to previous empirical 
approaches, in order to obtain a more physical description of the phenomenon 
and a more accurate control of its development. The flows are described as 
interacting “blocks” individuated by their mass centre position and velocity 
inside each cell. Such an approach was applied to lava flows; an event of the 
last (2002) Etnean eruption was simulated with relevant results. 



1 Introduction 

Cellular Automata (CA) applications to fluid-dynamics generated two important 
computational paradigms: lattice gas automata models [1] and hence the more robust 
lattice Boltzmann method [2], [3], This latter is based on microscopic models and 
mesoscopic kinetic equations, which rule a discrete world, where the fluid is 
compound by “fluid particles” moving in a regular lattice. This idealised represen- 
tation has been shown to be sufficient to recover complex features of the fluid- 
dynamics in many phenomena. Nevertheless many complex macroscopic fluid- 
dynamical phenomena seem difficult to be modelled in these CA frames, because they 
take place on a large space scale. Therefore, they force to use a macroscopic level of 
description that could involve the management of a large amount of data, e.g., the 
morphological data. Moreover, they could even imply complicated transition 
functions. 

Many attempts were made in the past for simulating surface flows with CA by our 
research group. Three-dimensional CA (cubic cells) models with few states were 
developed for lava flows [4]; they gave qualitatively sounding simulations, but 
applications to real cases were effective only for simplest cases, also regarding the 
longest computation duration (at that time). 

In order to overcome these difficulties, advantage was taken by the superficial 
nature of the lava flows: quantities concerning the third dimension, the height, may be 
easily included into the states of the CA, permitting models in two dimensions. It 
involves a large amount of states, which can be formally represented by means of 
substates: the Cartesian product of the sets of all substates constitutes the set of states. 
Here a substate specifies an important characteristic (e.g. altitude, temperature etc.) to 
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be attributed to the state of the cell and necessary for determining the CA evolution. 
Furthermore, an empirical algorithm was developed in this context in order to 
determine the outflows from a cell toward the remaining cells of its neighbourhood, 
giving rise to the model SCIARA and permitting first simulations of complex Etnean 
lava flows [5], 

This empirical method was formalised in [6] introducing new features; the good 
results encouraged further applications to the debris/mud flows [7], [8], [9] and to the 
soil erosion by water flows produced by rain [10]. 

This approach has a significant restriction, because it doesn’t permit to make 
velocity explicit: a fluid amount moves from a cell to another one in a CA step (which 
corresponds usually to a constant time), it implies a constant “velocity” in the CA 
context of discrete space/time. Nevertheless, velocities can be deduced by analysing 
the global behaviour of the system in time and space. In such models, the flow 
velocity emerges by averaging on the space (i.e. considering clusters of cells) or by 
averaging on the time (e.g. considering the average velocity of the advancing flow 
front in a sequence of CA steps). Note that such a restriction is typical of the CA, 
including lattice gas automata and lattice Boltzmann models. 

Constant velocity could be a limit for modelling finely macroscopic phenomena, 
because it is difficult to introduce physical considerations in the modelling at a local 
level. Furthermore, the time corresponding to a step of the CA is often deduced “a 
posteriori” by the simulation results and parameters of the transition function must be 
again verified when the size of the cell is changed. 

A solution is here proposed in the CA transition function: firstly, moving Hows 
toward the neighbour cells are computed, they are individuated by their mass and 
mass centre (i.e., the barycentre), to whom a velocity is attributed; secondly, the 
resulting new mass, mass centre and velocity are computed by composition of all the 
“interacting blocks inside the cell”, defined as all the inflows from the neighbours and 
all the outflows, whose mass centre is inside the cell. 

Such a method is presented in the second section, the third section defines the 
applications in modelling lava flows, the simulation results of the 2002 Etnean lava 
flows are shown in the fourth section, some conclusions are reported at the end. 



2 The Methodological Approach 

Classical (homogeneous) CA [11] are based on a regular division of the space in cells, 
each one embedding an identical finite automaton (fa), whose state accounts for the 
features attributed to the cell; S is the finite set of its states. The fa input is given by 
the states of neighbouring cells. The neighbourhood conditions are determined by a 
pattern of m cells, which is invariant in time and space. The /a have an identical state 
transition function vS m —>S, which is simultaneously applied to each cell. At time t=0, 
fa are in arbitrary states and the CA e volves changing the state of all fa simultaneously 
at discrete times (the CA steps), according to the transition function of the fa. 

CA formalisation must be further specified in order to account for some features of 
large scale surface flows (at least metres for the width, kilometres for the length). 
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2.1 Global Parameters 

Additional specifications need for permitting a correspondence between the system 
with its evolution in the physical space/time, on the one hand, and the model with the 
simulations in the cellular space/time, on the other hand. 

Primarily, the size of the cell p s (e.g. specified by the cell side) and the time 
correspondence to a CA step p t must be fixed. These are defined as “global 
parameters”, as their values are equal for all the cellular space. They constitute the set 
P together with other global parameters, which are commonly necessary for 
simulation purposes. 



2.2 Substates 

The state of the cell must account for all the characteristics, relative to the space 
portion corresponding to the cell, which are assumed to be relevant to the evolution of 
the system (e.g. the altitude). A substate is attributed to each characteristic. The 
possible values of the substate, referred to a characteristic ch (e.g. the possible values 
of the altitude), constitute the finite set S ch . 

The set S of the possible states of a cell is given by the Cartesian product of the sets 
of values S p S 2 ,...,S n of the substates: S=S I xS 2 X. xS n . 

When a characteristic (e.g. a physical quantity) is expressed as a continuous 
variable, a finite, but sufficient, number of significant digits are utilised, so that the set 
of permitted values is large but finite. 

The substate value is considered always constant inside the cell (e.g. the substate 
temperature). It implies a constraint for the global parameter p s “cell size” that must 
be chosen small enough so that the approximation to consider a single value for all the 
cell extension may be adequate to the features of the phenomenon to be modelled. 

The movement of a certain amount of fluid from a cell toward another cell is 
described introducing substates of type “outflow quantity”, “outflow mass centre”, 
“outflow velocity”. 

The substate “outflow velocity” impose a constraint on the global parameter p t 
because the shift, determined by the maximum possible velocity of the outflow in the 
time corresponding to a CA step, should not exceed the neighbourhood boundaries. 



2.3 Cellular Space 

The cellular space is two-dimensional because quantities concerning the third 
dimension, the height, may be included among the substates of the cell in a superficial 
phenomenon (e.g. the substates altitude, lava thickness). A regular hexagonal 
tessellation is chosen, because it minimises the spurious symmetries effect in 
comparison with triangular and square tessellation. 
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2.4 Elementary Processes 

The state transition function T must account for all the processes (physical, chemical, 
etc.), which are assumed to be relevant to the system evolution, which is specified in 
terms of changes in the states values of the CA space. As well as the state of the cell 
can be decomposed in substates, the transition function r may be split into 
“elementary” processes, defined by the functions <7/, (7i . . . . cr with/; being the number 
of the elementary processes. 

The elementary processes are applied sequentially according a defined order. Each 
elementary process involves the update of the states of the cells. The application of all 
the elementary processes constitutes a CA step. 

Different elementary processes may involve different neighbourhoods; the 
resulting CA neighbourhood is given by the union of all the neighbourhoods 
associated to each process. 



2.5 External Influences 

Sometimes, a kind of input from the "external world" to the cells of the CA must be 
considered; it accounts for describing an external influence which cannot be described 
in terms of local rules (e.g., the lava alimentation at the vents, i.e.the craters) or for 
some kind of probabilistic approach to the phenomenon. Of course special and/or 
additional functions must be given for that type of cells. 



2.6 Outflows Computation 

Outflows computation is performed in two steps: determination of the outflows 
minimising the “height” differences in the neighbourhood [6] and determination of 
the shift of the outflows. 

The neighbourhood of a cell (the “central” cell) is considered. It consists of the 
central cell (index 0) and its six adjacent cells (indexes 1, 2 ... 6). 

A height quantity, that involves substates concerning the third dimension, is 
defined for each cell according to the particular type of phenomenon (e.g., altitude 
plus lava thickness). 

The height of the central cell is divided in two parts: the fixed or bound part (£>[0]) 
and the movable part (m[0]). The movable part represents a quantity that could be 
distributed to the adjacent cells (e.g., the lava in terms of thickness), whilst the bound 
part cannot change value (e.g.. the altitude). There is no movable part for the adjacent 
cells (m[i]=0, l<i<6 , b[i]^0, l<i<6 ) because only the mobile part of the central cell 
may be distributed. 

The flow from the central cell to the i-th neighbouring cell will be denoted by /[/] , 
0<?<8, where f[0] is the part of m[0 ] which is not distributed. Let b'[i]=b[i]+f[i], 
0<i<6 be the sum of the content of a neighbouring cell, plus the flow from the central 
cell, and let b'_min be the minimum value for b'[i\. 

Thus, the determination of outflows, from the central cell to the adjacent cells, is 
based on the local minimisation of the differences in “height”, given by the following 
expression: 
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n - 1 

^ ( b' [i ]-b'_ min ) (1) 

i = 0 

The “minimisation” algorithm, i.e. the algorithm for the minimisation of dif- 
ferences, guarantees the mass conservation; its complete specification and correlated 
theorems can be found in [6] . 

The minimisation algorithm guarantees indeed the maximum possible equilibrium 
in the neighbourhood when the outflows will reach the adjacent cells, but the velocity 
of the outflows could be different because they also depend on gravity, slope and 
friction forces. 

A velocity is computed for each minimising flow /[;] according to the following 
equations (deduced in sequence and similar to the Stokes equations) where F is the 
force, m is the mass of /[/], g is the acceleration of gravity, v is the velocity of /[;'], t is 
the time, v 0 is the initial velocity, 9 is the angle of the slope between the central cell 
and the neighbour i, a is the friction parameter. 

F = mg sin 9 —a m v 

dv/dt = g sin 9- av (2) 

v = (v 0 - g sin 9/ a) e a + (g sin 9/ a) 

These equations describe a motion, which is depending on the gravity force and is 
opposed by friction forces. An asymptotic velocity limit is considered because the 
effect of the friction forces increases as the velocity increases. 

This formal apparatus permits to describe the outflow from a cell c’ towards its 
neighbour c” as the shift of a “block” (i.e, /[(]), individuated by its mass, mass centre 
position and velocity. Such a shift couldn’t be sufficient to locate the new position of 
the block mass centre outside the central cell c’: so more than one step could be 
necessary for a transfer of the block to the cell c”. 



3 The SCIARA y2b Model for Lava Flows 

Navier-Stokes equations represent a good, but perhaps not completely exhaustive 
approach in terms of systems of differential equations for accounting of physical 
behaviour of lava flows, moreover such equations become more complicated because 
lava flows can range. Theologically, from nearly Newtonian fluids to brittle solids by 
cooling [12]. Numerical methods look difficult to be applied efficaciously. So 
different CA approaches were followed by our research group [4], [5], [6], [13] and by 
other research groups: 

Young and Wadge [14] presented a clever simulation CA model, but applicable 
only to simple flow fronts. 

Miyamoto and Sasaki [15] developed a CA model for non-isothermal laminar 
Bingham flows and validated it on real lava flows at Mt. Miyakejima; they solved the 
problem of the spurious symmetries with a probabilistic method, obtaining positive 
results in their simulations. 
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3.1 Main Specifications of the Model SCIARA 72 b 

SCIARA y2b is the last version of the family of CA models for lava flows SCIARA 
( Simulation by Cellular Interactive Automata of the Rheology of Aetnean lava flows', 
the acronym was devised by the word used for the solidified lava path in Sicilian), it 
improves the previous versions introducing outflows as blocks. 

SCIARA 72 b = <R, V, X,S,P,t;0>: 

R = \(x, y)\ x,yeN, 0<x<l x , 0<y<l y } identifies the set of regular hexagons covering the 
finite region, where the phenomenon evolves. ./Vis the set of natural numbers; 

V is the set of cells, corresponding to the area of craters; 

X is the hexagonal neighbourhood; 

S is the finite set of states of the /a; S = S a xS lh xS x xS y xS T xS Fth 6 xS Fx xS Fy xS Fx \ 

S a is the cell altitude; 

S th is the thickness of lava inside the cell; 

S x , S are the lava mass centre co-ordinates x and y, referred to the cell centre; 

S T is the lava temperature; 

S Fth is the lava flow, expressed as thickness (six components); 

S Fv is the velocity of the flow attributed to the mass centre (six components); 

S Fx , S F are the co-ordinates x and y of the flow mass centre (six components); 

P is the set of the global parameters; P={p a , p step , p TV , p TS , p a[lhv , p adhS , p^, P v[ }; 
p is the apothem of the hexagonal cell; 
p is the temporal correspondence of a step of SCIARA; 
p ,p are the lava temperature at the vent and the solidification temperature; 
p is the cooling parameter; 

p is the “asymptotic limit of velocity” for lava flows; 

p , p are the adherence value at the vent and at the solidification temperature 
(the adherence is the lava thickness, that may not be removed) 

T:S 7 ->S is the transition function, composed by four “elementary” processes: 

<7 fl determinates the lava outflows by application of minimisation algorithm; 

<7 sh determinates the lava outflows shift by application of velocity formulae; 

<7 mix computes the mixing of the blocks: remaining lava and inflows inside the cell; 
a cool computes the lava cooling by radiation effect and solidification effects; 

([> :S lh xN—>S llt xS T specifies the emitted lava from the V cells at the step teN. 



3.2 The SCIARA 72 b Transition Function 

At the beginning of the simulation, the S a substates assume the morphology values 
and all the remaining substates are initialised to zero. 

At each step, the function (f> is applied to the cells of V , determining the new lava 
thickness and attributing the value p to the substates S T , then the four elementary 
processes are applied in sequence, each one updates the states of all the cells in R. 

The “elementary” processes are specified in terms of equations, where names of 
parameters and substates are derived by the lower indexes specified in the section 3.1: 
the parameter p ltanie is indicated as name', name and nname are respectively referred to 
the substate of S mme and its new value; when the specification of the index ; of a 
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neighbourhood cell is necessary, the notation name[i\ is adopted; an index pair <i,j> 
is specified, if necessary, for the substates of type flow; Fname[i,j] means the value of 
the flow substate name from the cell i toward the cell j in the neighbourhood ( i=0 for 
the outflows, j=0 for the inflows, whose values are trivially deduced by the outflows). 

Determination of the Lava Flows. The resistance of lava increases as temperature 
decreases. Because of complexities which are inherent in specifying lava rheology 
and its variation with temperature [12]; rheological resistance was modelled in terms 
of an “adherence effect”, measured by adh, which represents the amount of lava 
(expressed as a thickness, as discussed above) that cannot flow out of a cell because 
of rheological resistance, adh is assumed to vary with temperature T according to a 
simple inverse exponential relation adh=ce ~ kr , where c and k are positive constants 
such that adhV=ce~ kTV and adhS=ce~ kTS . 

The minimisation algorithm is applied with the following specifications [5], [13]: 
m[0] =th[0] - adh , b[0\ = a[0] +adh, b[i] = a[i ] + th[i\, l<i<6. 

Determination of the Lava Outflow Shift. Etnean lava flows are of “aa” type [16]; 
therefore, their behaviour may be considered purely gravitational, so there is no 
dependence on the previous velocity: Fv[0,i] =vl-sinO, where the velocity limit vl=g/a 
and the angle 0 is computed according the slope of the ideal path that a flow could 
follow from the point with co-ordinates (x[0], y[0], a[0]+th[0]) of the central cell to 
the point with co-ordinates (0, 0, a[i]+th[i]) of the neighbour i. 

The projection of the shift of the flow mass centre on the plane x-y is given by: 
shift= vl-sinOcosO- p , it determines nFx[0,i\ and nFy[0,i]. If the shift is so long that 
the flow mass centre exceeds the boundary of the cell 0, the flow is attributed to the 
cell i, otherwise it remains in the cell 0. 

Interaction of Blocks inside the Cell. Interaction of blocks inside the cell consists in 
mixing of lava components i.e. inflows and lava remaining in the cell. The resulting 
new thickness is obtained by the summation of the thicknesses of all components, 
while the resulting new temperature and new co-ordinates x and y of mass centres are 
computed by the weighted average of these quantities on the thickness. Note that the 
outflows, whose mass centre remain inside the cell are considered as lava remaining 
in the cell, while those with mass centre outside the cell (i.e, inside the neighbour cell 
i) belong to the inflows of the neighbour cell i. 

Lava Cooling and Solidification. It is always assumed here that different flows in a 
cell can be thermally well mixed, although this approach involves sometimes a rough 
approximation. The physical formula applied with the same considerations of the 
other versions of SCIARA [5], [13] : 



nT = T /\j I + (t 3 ■ cool ■ step/th) 



(3) 
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4 Simulation of the Lava Flows of 2002 Etnean Eruption 

The event which occurred in the autumn of 2002 at Mount Etna (Sicily) involved two 
main lava flows. The former started on the NE flank of the volcano, with lava 
generated by a fracture between 2500 m a.s.l and 2350 m a.s.l., near the 1809 fracture 
and pointed towards the town of Linguaglossa and was exhausted after 8 days. 

Such real event (fig. 1) was generated by numerous vents originated from the 
“fracture of 1809” [17]; simulations were actually carried out by considering two 
major emission points, neglecting the others, since there was no significant interaction 
between these flows and the main lava stream. 




Fig. 1. The maximum extension of the 2002 Etnean lava flow of NE flank, menacing the town 
of Linguaglossa. 




Fig. 2. Simulation steps of 2002 Etnean lava flows in the NE flank, corresponding to the lava 
fields of the first, fourth and ninth day; maximum extension was reached before the ninth day. 

The Linguaglossa lava flow was chosen for validating SCIARA y2b for the more 
precise (but not optimal) available morphological data. Positions of craters are 
considered indisputable data, but lava flow rates are considered approximate for 
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obvious known reasons, i.e., the difficulty of measuring flow rates and evaluating 
eventual rapid changes. 

Many simulations with different values of global parameters were performed in 
order to obtain the better agreement between real and simulated event in terms of 
surface covered by the lava flows and of the lava flows advancing front. 

The following values obtained the best simulation (fig 2): p =2. 5m, p =5s, 

a step 

p =1373°K,o =1123°K,p =7.5-10' 1 V-m-°K' 3 ,p =0.9m,p =6m,p =2m/s. 

7V TS cool r adhV r adhS r vl 

The simulation captures the major features of the real event, both in terms of lava 
extent and in terms of covered distance; moreover field data concerning the advancing 
flow front fits well with the corresponding steps of simulation. 

Other simulations were made just doubling and halving p . Results were 
interesting because simulations were strictly similar: it shows that explicit velocity 
permits time scalability. This feature was not permitted in previous SC1ARA models, 
where time was deduced “a posteriori” on the evaluation basis of the simulation 
goodness. The same tests for p were not performed because of the imprecision of the 
available DEM (Digital Elevation Model). 

Improvements of this model concern two major aspects. Even if in the previous 
model the overall lava covered distance reflected the real event, the new model has 
permitted a time correspondence between lava extents at intermediate timings. The 
second improvement concerns the areal extent of the simulated lava flow. The 
evaluation of the goodness of a given simulation is performed by a evaluation 
function, e, simply comparing the extent of actual and simulated events (in terms of 
affected area). Let R and S be the regions affected by the real and simulated event, 
respectively; let m(A) denote the measure of the set A. The function e is defined as: 



I m(RnS) 

y m (RuS) 



(4) 



Note that ee [0, 1 ]. Its value is 0 if the real and simulated events are completely 
disjoint, being m(Rr^S)= 0; it is 1 in case of perfect overlap, being m(Rr)S)= in(RuS). 

Regarding the simulation here presented, <?=0.689, while the e value didn’t exceed 
0.65 in the best simulation of previous SCIARA models. 



5 Conclusions 



SCIARA y2b is not a sophisticated model from a physical viewpoint: formulae for 
explicit velocity are simple, they express the velocity limits, but they could be 
improved in order to capture better the rheological features of lava flows. The model 
permits also mixing of lava flows at very different temperatures, rotational effects are 
reduced to simple displacements between couples of cells, but it is more physical and 
less empirical in comparison with the previous versions of SCIARA. 

Simulations are valuable and developments are being planned for simulating more 
complex physical phenomena concerning lava flows (e.g. tunnel effects, lava layers at 
different temperature, etc.) or for extensions to other types of lava. 
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Abstract. We introduce a Cellular Potts model (a cellular-automaton- 
based Monte-Carlo model) of in vitro capillary development, or angiogen- 
esis. Our model derives from a recent continuum model, which assumes 
that vascular endothelial cells chemotactically attract each other. Our 
discrete model is “cell based.” Modeling the cells individually allows us 
to assign different physicochemical properties to each cell and to study 
how these properties affect the vascular pattern. Using the model, we 
assess the roles of intercellular adhesion, cell shape and chemoattrac- 
tant saturation in in vitro capillary development. We discuss how our 
computational model can serve as a tool for experimental biologists to 
“pre-test” hypotheses and to suggest new experiments. 



1 Introduction 

A detailed understanding of capillary blood vessel formation, or angiogenesis, 
is essential to understand and control physiological and pathological processes 
from wound healing to tumor growth and angiogenesis-related pathologies such 
as advanced diabetic nephropathy. Capillaries develop in two main ways: through 
the aggregation of endothelial cells into capillary cords, and through sprouting 
from existing capillaries. 

In vitro culture of human umbilical vascular endothelial cells (HUVEC) in 
Matrigel is a popular experimental model of capillary development (see e.g. [2]). 
Matrigel is an extracellular matrix product obtained from murine tumors. The 
extracellular proteins and growth factors in the gel stimulate HUVEC cells to 
form networks (Fig. 1) resembling vascular networks in vivo. HUVEC-Matrigel 
cultures and related HUVEC cultures in collagen [3] and fibrin [4] gels are stan- 
dard models in biomedical research to identify the key molecular players in 
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pathological and physiological angiogenesis, to unravel the role of cell growth 
and apoptosis in vasculogenesis and to probe potential anti-angiogenic pharma- 
ceuticals. 

The biophysical mechanisms by which endothelial cells form such networks 
are poorly understood. Several computational studies have suggested that en- 
dothelial cells, behaving according to a small set of rules, could suffice to ex- 
plain network-formation. Most hypotheses assume that either chemotaxis [5,6] 
or tractional forces relayed through the extracellular matrix [4,7,8] drive cell 
aggregation. Partial differential equation models of both mechanisms generate 
patterns that resemble in vitro networks, so which of them most closely parallels 
the biological mechanism remains unclear. 

These continuum approaches implicitly assume that the endothelial cells that 
construct the pattern are much smaller than the length-scale of interest, namely 
that of the “cords” and “nodes.” In vitro , however, at most a dozen elongating 
cells form the cords, which are one to a few cell-diameters thick (see Fig. 1). 
Thus, we expect the scale of the endothelial cells to play an important role in 
patterning. The properties of individual cells, such as their shape, their mu- 
tual adhesion and their adhesion to the extracellular matrix may also affect the 
pattern resulting from their interaction. 

This paper develops a cell-oriented approach to modeling in vitro vasculogen- 
esis, based on the Gamba-Serini chemotaxis model [5,6]. We construct computa- 
tional models of individual HUVEC cells which mimic experimentally-observed 
cell phenomenology, including response to external chemical signals, cell elon- 
gation, cell adhesion, chemotaxis, haptotaxis, etc. We then release these sim- 
ulated cells in a “virtual Petri-dish” and quantitatively compare their macro- 
scopic, collective behavior to experiments. We attempt to recover minimal sets 
of physicochemical properties and behavioral rules for the cells which reproduce 
the experimentally-observed tissue-level pattern. 

Our focus on cell phenomenology may seem crude compared to large-scale, 
post-genomic, systems-biology initiatives whose aim is to model ab initio and in 
detail the internal structure and genetic regulation of individual cells, and even- 



211 9h 

Fig. 1 . HUVEC culture on Matrigel, 4x magnification, 2h (left) and 9h (right). To 
improve contrast, we have applied an Opening Top-Hat transform [1] with a small-disk 
structuring element. 
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tually collections of interacting cells (see e.g. [9,10]). However, the cell-oriented 
approach allows us to distinguish a) how genomic information determines cell 
phenomenology (using detailed, single cell models) from b) how collections of 
cells exhibiting a particular phenomenology interact during biological morpho- 
genesis (using a “cell-oriented” approach). The cell-oriented approach has been 
successful in the study of biological morphogenesis. In particular, it can simu- 
late the entire developmental life cycle of the cellular slime mold Dictyostelium 
discoideum [11,12], convergent extension in early vertebrate embryos [13], tumor 
invasion [14] and pattern formation in vertebrate limb cell cultures [15]. 

The Cellular Potts model (CPM) [16,17], a cellular-automaton-based Monte- 
Carlo method, is a convenient cell-oriented simulation framework. Early CPM 
studies of differential-adhesion-driven cell rearrangement in cell aggregates quan- 
titatively reproduced cell sorting experiments [16] . An energy-minimization phi- 
losophy, a set of energy constraints and auxiliary conditions determine how cells 
move. Connections between cells determine a cell-cell contact (or bond) energy, 
favoring stronger over weaker bonds and shortening of the boundary length. An 
energy constraint regulates cell volume or area. Further constraints or auxiliary 
conditions easily extend the CPM to include clremotaxis [11], cell growth, cell 
death [18] and cell polarity [13]. 

The remainder of this paper introduces our hybrid CPM/PDE model and 
extensions to the CPM motivated by HUVEC phenomenology. We then study 
patterning in the CPM/PDE version of the Gamba-Serini clremotaxis model. 
Finally, we discuss the biological and computational relevance of our results. 

2 Methods 

2.1 Hybrid Cellular Potts Model 

We use a hybrid Cellular Potts (CPM) and partial differential equation (PDE) 
model (see [11,12,17]). The CPM models endothelial cells, while the PDEs model 
the chemoattractant. Experimentally-confirmed cell behaviors, which we include 
in the model are, 1) secretion of chemoattractants, 2) clremotaxis of cells up 
chemical gradients, and 3) progressive elongation of cells. 

The CPM represents biological cells as patches of lattice sites, x, with iden- 
tical indices cr(x), where each index identifies, or “labels” a single biological 
cell. Connections between neighboring lattice sites of unlike index a(x) ^ cr(x') 
represent membrane bonds, with a characteristic bond energy , , where we 

assume that the types and numbers of “cell adhesion molecules” (CAMs) of the 
interacting cells determine J. An energy penalty increasing with the cell’s devi- 
ation from a designated target volume A a imposes a volume constraint on the 
biological cells. To mimic cytoskeletally driven membrane fluctuations, we ran- 
domly choose a lattice site, x, and attempt to copy its index a x into a randomly 
chosen neighboring lattice site x'. To ensure isotropy, we use the twenty, first- to 
fourth-order neighbours on a square lattice. On average, we attempt an update 
at each lattice site once per Monte-Carlo step (MCS). We calculate how much 
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the Hamiltonian would change if we performed the copy, and accept the attempt 
with probability: 



P{AH) = {exp {-{AH + H 0 )/T),AH > -H 0 - 1, AH < -H 0 }, (1) 

where H 0 > 0 is an energy threshold which models viscous dissipation and energy 
loss during bond breakage and formation[18]. We then define the Hamiltonian 
as: 



H — ^2 — ^<Ta,,<7„/) + — A a ) 2 , (2) 

x,x' O’ 

where A represents resistance to compression, and the Kronecker delta is 8 XtV = 
{l,x = y; 0, x y}. The cells reside in a “medium” which is a generalized 
CPM cell without a volume constraint and with a = 0. In most simulations, 
we use a bond energy J cc = 5 between the endothelial cells, and J c m = 20 
between the endothelial cells and the medium. We further define a surface tension 
JcM = JcM — J cc /2, which enables us to determine whether the cells cohere 
(7 C M > 0) or dissociate (y C M < 0) [16] in the absence of chemotaxis. Our default 
cellular adhesion setting is adhesive, i.e. y C M > 0. We define a special, high cell- 
border energy J c b = 100 to prevent cells from adhering to the boundaries. The 
viscous dissipation Hq and all terms in the Hamiltonian, i.e. the bond energies 
J, and the prefactors to the additional energy terms, such as A, scale with the 
temperature T; i.e. if we multiply T by a factor r, we can multiply the H 0 and 
the Hamiltonian by the same factor and obtain the same simulation results. 

In analogy to the Gamba and Serini PDE model [5,6], we set the diffusion 
and secretion of the chemoattractant c to: 



^=«^,o-(l-^,o)£C + f)V 2 c, (3) 

where 8^,0 = 1 inside the cells, a is the rate at which the cells release chemoat- 
tractant, and e is the decay rate of the chemoattractant. Thus, every site within 
the cells secretes the chemoattractant, which decays only in the medium. We 
solve the PDEs numerically using a finite difference scheme on a lattice match- 
ing the CPM lattice, using 20 steps per MCS with At = 0.2. For these parameters 
the chemoattractant diffuses much more rapidly than the cells, enabling us to 
ignore the advection that occurs as the cells push the medium forward. 



Chemotaxis. We implement preferential motion of the cells along gradients of 
the chemoattractant c by defining [11]: 






X 



c(x, t) 



s c(x, t) + 1 



(1 - 



(4) 




Cell-Oriented Modeling of In Vitro Capillary Development 429 



where \ is the strength of the clremotactic response, and the saturation s sets 
the Michaelis-Menten constant of the clremotactic response, for which we use 
s = 0.01 by default. For s = 0 the cell’s response to gradients does not level off at 
higher concentrations, as in the original method of Savill et al. [11], while for large 
s the cells become unresponsive. Because the chemical field biases cells’ boundary 
copying, each cell moves with an velocity v on \fil-i) ]>T E-,: Pj(di), where the 
sums run over the sites of the cell and the twenty lattice directions respectively, 
and Pj(di) denotes the probability that site j copies in lattice direction dt. The 
prefactor yj J l has units of force, but we can also interpret it as the clremotactic 
strain field [19]. 



Cell elongation. The HUVEC cells in our cultures elongate progressively as 
the vessel- like pattern develops. To study how cell elongation affects the pattern, 
we add a cell-length constraint to the Hamiltonian: 



H" = H' + X l (1-L) 2 , (5) 

where l is the length of the cell along its longest axis, L is its target length, and 
\l is the strength of the length constraint. Larger values of A l result in more 
elongated cells. Following Zajac et al. [20], we calculate the short and long axes 
of the moments of inertia as: 



ixx — Ei(y* v) > 

hy = - T,i( x i ~ x )(Vi ~ V)> 

lyx — J-xyi 

I yy = E i(Xi- x ) 2 , (6) 

where the sum is over all the sites x = ( Xi,yi ) in the cell. We determine the 
length l from the largest eigenvalue of I as l = 2y^Xb, where: 

Ab = 2^ xx lyy') 2 \J xx ~ lyy) 2 + ^xy (7) 

We can update the inertia tensor locally after each cell extension or retrac- 
tion, by constructing it from the first and second order moments of the positions 
of the sites the cells occupy: 




Ei A 
Ei X iVi 

E 



i(Ei^) 2 > 

\ E< *> Ei 

i(Ei2/*) 2 , 



(8) 



with a the cell area. Thus Eq. 7 determines l without lengthy calculation. 
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The length constraint can cause cells to split into disconnected patches. We 
prevent this artifact by introducing a connectivity constraint. To check whether 
an index copy into site x will change local connectivity, we count how many of 
its neighbors x'i have equal index to x while the next in cyclic order has unequal 
spin. If this quantity E; - ( ^(*),a( a! ' i+1 )) > 2 > with the sum run- 

ning in cyclic order, changing the site will destroy the local connectivity. We have 
currenly only implemented this algorithm for second-order neigborlroods ( i.e . 8 
neighbours) . This test for connectivity loss gives “false positives” for non-convex 
patches, which occur in our simulations when cells suddenly shrink, for example 
during “apoptosis” [18]. Instead of introducing a very expensive ( 0(N 2 )) global 
connectivity test, we tolerate temporary violations of connectivity for large en- 
ergy drops. The bond energies ensure that the lifetime of disconnected patches 
is short. We assume that cell fragmentation is energetically costly, and set a 
high energy threshold H 0 (typically H 0 = 500) iff the attempted update would 
change the local connectivity. 



2.2 Simulation Set-Up and Initialization 

In our partial differential equations (PDE), we set boundary values to zero (i.e. 
the boundaries absorb the chemoattractant), while for the CPM we implement 
repellent boundary conditions by setting the cell-boundary binding energy to 
J C B = 100. We use a 500 x 500 lattice, where each lattice site represents an area 
of about 4/im 2 . Cells have actual areas of around 45 lattice sites, equivalent to a 
typical endothelial cell area of around 150 [imf [21]. To diminish boundary effects 
we initially disperse n cells randomly within a centered 333 x 333 square. For 
n >= 3000 we disperse the cells all over the field to minimize cell overlap. After 
each Monte Carlo Step (MCS) we update the PDE 20 times with At = 0.2. 



2.3 Morphometry 

We characterize the patterns by measuring the average size La of the open spaces 
which the vessel-like structures enclose (lacunae). La = 1/N E,; La j, with N the 
number of lacunae and La; the size of each lacuna, which we identify using a 
standard connected-component labeling algorithm [1], 

3 Results 

Figs. 2 and 3 review our results. Fig. 2a reproduces the results of the Gamba 
and Serini PDE model [5,6] in our cell-oriented model. The cell-adhesion settings 
are strongly adhesive (y C M = 17.5). The cells aggregate into cords and nodes, 
typical of HUVEC Matrigel cultures (Fig. 1) and the Gamba-Serini model. As 
the cell density increases, we find a percolation transition as in the Gamba- 
Serini model [5]; for low densities, n < 1000, the cells do not form a connected 
network, while for high densities, n > 1000, all cells interconnect (see Fig. 2a). 
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Fig. 2. Overview of parameter sweeps. Typical cell patterns after 5000 MCS. Lattice 
size 500 x 500, n = 1000 scattered within inner 333 x 333 subfield (e: n = 555). 
Parameters were T = 50, Aa = 50, A = 50, 7 c m = 17.5, s = 0.01, x = 2000, a = 0.01, 
e = 0.05, D = 1, Xl = 0 (e: A l = 1.0), unless indicated otherwise. 



Also, as in the Gamba-Serini model, the size of the lacunae depends on the rate 
of chemoattractant decay, as Fig. 2b shows. 

In Figs. 2c-e, we systematically modify the Gamba-Serini model by chang- 
ing the biophysical properties of the individual cells. First, we study the role 
of intercellular adhesion. In our model, cell adhesion is essential. If we reduce 
the adhesivity of the endothelial cells, a network no longer forms and the cells 
aggregate into “islands” (Fig. 2c). Like the Gamba-Serini model, our model can 
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Fig. 3. Dependence of lacuna size on model parameters, after 5000 Monte Carlo steps. 
Parameters as in Fig. 2. Error bars indicate standard deviations ( n = 5) 



also form networks without cell adhesion, but these networks are unstable and 
decay into an “island-like” pattern (results not shown). 

Fig. 2d shows the effect of changes in s, the threshold for chemotaxis satu- 
ration (sensitivity) (see eq. 4). Up to around s = 0.1 vessel- like patterns form, 
while for larger values more amorphous patterns with larger clusters and few 
lacunae form. The lacuna size becomes more variable for larger values of s (see 
Fig. 3d). 

In the in vitro culture experiments, the endothelial cells elongate during 
vessel-like patterning. Although elongation is not necessary for vessel-like pat- 
terning in our model, cell elongation is one of the first observable signs of vessel- 
like patterning in HUVEC cultures. In Fig. 2e and Fig 3e we investigate the 
effect of cell elongation on patterning at low cell densities (n = 555). Cell elon- 
gation facilitates the interconnection of isolated parts of the pattern (see Fig. 3e). 
For very large length constraints ( L > 300) the cells fragment, after which the 
vessel-like pattern no longer interconnects. At higher densities, when the pattern 
already fully interconnects even for round cells, cell elongation has no effect on 
lacuna size, but affects the shape of the lacunae (results not shown) . 

4 Discussion and Conclusions 

Computational screening of the parameter dependence of patterning may help 
to direct experiments to identify the key regulators of vascular development and 
suggest new hypotheses. If chemotaxis drives vascular development, as in the 
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Gamba-Serini model, our simulations suggest that endothelial cell adhesion is 
essential to form stable vascular networks. Furthermore, our simulations suggest 
that pattern formation changes qualitatively when the receptors saturate at lower 
concentrations of chemoattractant. We could test this effect experimentally by 
specifically inhibiting the production of receptors, in which case the remaining 
receptors would saturate at lower concentrations, or by adding receptors with 
higher affinity. Cell extension also strongly affects the multicellular pattern, re- 
ducing the cell density network formation requires. Biological experiments might 
attempt to specifically target the elongation of the endothelial cells. Preliminary 
results also show that cell motility above a mininum required velocity does not 
significantly affect patterning (not shown). Continuous models have great diffi- 
culty assessing the role of these parameters, in particular of intercellular adhesion 
and cell morphology. 

In our ongoing work we are refining and validating our computational model 
by obtaining experimentally derived values for the model parameters and by 
comparing our simulation results quantitatively to time-lapse videomicroscopy 
experiments. We are analyzing the dynamics of patterning in HUVEC cultures 
under different physiological and pathological conditions and comparing these 
to the computational model’s dynamics for similar simulated conditions. Grad- 
ual cell elongation, rather than the instantaneous elongation that we employed 
here, may change the model’s dynamics. We also plan to model the interaction of 
the individual cells with the viscoelastic extracellular matrix, which experiments 
and theory [4] suggest is crucial to in vitro vasculogenesis. These studies may 
lead to better understanding of the biophysical mechanisms of vascular develop- 
ment, which will suggest new experimental tests of the genetic and biophysical 
regulation of vascular development. 
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Abstract. We outline the basic principles of neuropercolation, a gener- 
alized percolation model motivated by the dynamical properties of the 
neuropil, the densely interconnected neural tissue structure in the cortex. 
We apply the mathematical theory of percolation in lattices to analyze 
chaotic dynamical memories and their related phase transitions. This 
approach has several advantages, including the natural introduction of 
noise that is necessary for system stability, a greater degree of biological 
plausibility, a more uniform and simpler model description, and a more 
solid theoretical foundation for neural modeling. Critical phenomena and 
scaling properties of a class of random cellular automata (RCA) are stud- 
ied on the lattice 7Z 2 . In addition to RCA, we study phase transitions 
in mean-field models, as well as in models with axonal, non-local inter- 
actions. Relationship to the Ising universality class and to Toom cellular 
automata is thoroughly analyzed. 



1 Introduction 

Information encoding in the form of oscillatory patterns has advantage com- 
pared to fixed-point (static) memories. A number of researchers acknowledged 
the limitation of static approaches, and have worked around it. Grossberg has 
proposed adaptive resonance theory network which is a sophisticated topological 
arrangement of components [1]. Hopfield proposed his famous associative mem- 
ory network, which is a recurrent system [2], Elman uses processing elements 
with feedback as memory devices [3]. Cellular nonlinear neural network (CNN) 
represent another paradigm showing great promise by implementing cellular au- 
tomata theory in the form of a programmable memory device [4] . In a separate 
development, Kill nets have been introduced which represent a prototype of 
dynamic memory devices based on information encoding in aperiodic (chaotic) 
attractors [5]. 

* This research has been funded in part by NSF BITS Grant #0130352. 
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What distinguishes brain chaos from other kinds is the filamentous texture 
of neural tissue called neuropil, which is unlike any other substance in the known 
universe [6, 7]. Neural populations stem ontogenetically in embryos from aggre- 
gates of neurons that grow axons and dendrites and form synaptic connections 
of steadily increasing density. At some threshold the density allows neurons to 
transmit more pulses than they receive, so that an aggregate undergoes a state 
transition from a zero point attractor to a non-zero point attractor, thereby 
becoming a population. Such a property has been described mathematically in 
random graphs, where the connectivity density is an order parameter that can 
instantiate state transitions [8, 9]. 

Information processing in the proposed chaotic dynamical memories is closely 
related to percolation phenomena [10, 7, 11]. For about forty years now, perco- 
lation theory has been an active area of research at the interface of probability 
theory, combinatorics and physics [12]. Interest in various aspects of standard 
percolation remains high, including estimates of critical probabilities [13, 14]. 
There has been much work on the family of processes know as bootstrap per- 
colation [15 - 19]. Many percolation problems exhibit phase transitions, where 
for p less than some critical probability p cr it only finite clusters exist, and for 
p > Pcrit infinite clusters almost surely exist. We propose to apply the math- 
ematical theory of percolation in lattices to analyze dynamical neural systems 
and their related phase transitions. First the basics of bootstrap percolations are 
described. This is followed by the introduction of several generalizations of stan- 
dard percolation, which we call neuropercolation. Generalizations include the 
biologically motivated activation and deactivation functions, random transition 
rules, and non-local interactions. Non-local interactions mean a rewiring of the 
connections, which is related to small- world phenomena [20]. Related work on 
percolations in small-world models is described in [21, 22]. 

First we describe theoretical results on mean field models, and critical behav- 
ior observed in local RCAs. This is followed by the analysis of non-local models 
and the comparison of their behavior with Ising universality class and Toom 
automata [23]. Finally, we discuss the directions of combining populations of 
nonlinear processing elements to multi-layer architecture, in order to describe 
dynamic processes in cortices. 



2 Model Description and Activation Density 

Here we introduce the mathematical formalism of neuropercolation. A key pa- 
rameter of the system is the probability p, which characterizes the stochastic 
nature of the transition rule. We will see that the random initialization is of 
secondary importance when having random transition rule. I the following con- 
siderations we describe effect related to this probability p. 

Let A be a finite subset of 7Z d containing the origin 0 = ( 0 , . . . , 0 ) and let 
p: 2 a [0, 1] be a function that assigns for each subset S C A a probability ps- 
We define a stochastic process as a sequence of subsets X t C 7Z d by including 
each x = (xi,...,Xd) in X t+ \ independently with probability ps where S = 
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A fl (X t — x) and X t — x is the translate of X t by — x. We start the process 
with Xq given by some specified distribution. We call such a process a random 
cellular automaton on 7Z d . Usually we define A to be the closed neighborhood of 
z, consisting of all points unit distance from z together with z itself. Similarly we 
can define random cellular automata on the d-dimensional torus 7Z n x x . . . x 7Z nd , 
or on the corresponding finite grid (with suitable boundary conditions imposed) . 
We start by considering p of a certain special form. Assume ps depends only on 
the cardinality of S and whether or not the site itself is active (z £ S). We write 
pi- 0 '* in place of pg when |5| = r and z ^ S and p ^ when 151 = r and z £ S. 
We shall call these models isotropic. Isotropic models are substantially more 
restrictive than the general case, but they still have complex behavior. We call 
the model fully isotropic if p^ = pi 1 ' 1 = p r for all r. In this case, the site itself 
is treated on the same basis as its neighbors. By complementing the sets X t we 
obtain a new model with p'[ l> = 1— (or more generally, p' s = 1 ~Pa\s)- We 

call the model symmetric if it is the same as its complement, so p^ = 1 — 
for all r (or ps = 1— pa\s for non-isotropic models). For symmetric models, there 
is a further symmetry. If we complement every other X t , then this is equivalent to 
a new symmetric model with pg replaced by p* s = 1 — pg. We call this new model 
the dual model of a symmetric model ( pg ). Phases in the original model with 
density not equal to 0.5 will give oscillating phases in the new model with period 
two [24]. Random cellular automata generalize deterministic cellular automata 
such as Conway’s game of life. Given the complexity of deterministic automata it 
should not come as a surprise that random automata display extremely complex 
behavior. 



3 Mean Field Models 

On a finite grid or torus, we can compare random automata with the correspond- 
ing mean field model over 7Z 2 . In the mean field model instead of taking |A| — 1 
specified neighbors, we take \ A\ — 1 elements of the grid at random (with replace- 
ment) . It is clear that the mean field model does not depend on the topology of 
the grid, and the only information of relevance in X t is given by its cardinality 
\X t \. We define x t to be \X t \/N where N is the size of the finite grid or torus. 
Thus Xt £ [0, 1] gives the density of points in X t . Let us consider the symmetric 
fully isotropic model Po — Pi — P 2 — P- It can be shown that for the mean field 
model, this has one fixed point at x = 0.5 for p £ [^, |], but for p < ^ = .2333 
the fixed point x = 0.5 is unstable and there are two other fixed points which are 
stable [24]. Figure 1 illustrates the average density and its variance for various 
p values. 

Our real interest lies in random cellular automata on 7Z l rather than the 
mean field models. So one question is how well do the mean field models approx- 
imate the corresponding random automata? Unfortunately, the answer is ‘not 
very well in general’. Since the mean field model is equivalent to that of a fully 
isotropic model, the best we could hope for is that it approximates fully isotropic 
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models. However, even fully isotropic models can exhibit behavior that is richer 
than that of the mean field models. The theoretical analysis of local RCA is 
extremely difficult. Recently, a rigorous mathematical derivation of properties of 
phase transitions in RCA with very small p ss 0 has been given by [25]. Phase 
transitions near the critical point are not tractable by rigorous mathematical 
tools at present. Accordingly, we rely on computer simulations with local and 
non-local RCAs in the following considerations. 



P Var|p - 0.5 | 




has a sharp increase as p approaches p c . 



4 Simulation of RCA on Finite Lattices 



4.1 Phase Transition in Local RCA 



The simulation was run on a 2-dimensional torus with noisy majority rule with 
4 + 1 local neighbors. All the sites are updated simultaneously at a certain time 
t. In this section we use the statistical physics methodology of finite size scaling 
that has been developed for Ising systems and consecutively applied to various 
cellular automata, including Toom automata [26, 23, 27]. Here we summarize 
the major results and conclusions; for details, see [28]. We define the activation 
density across the torus as follows: 



X t 

Ntotal 



(1) 



where, X t denotes the number of active sites at time t, and N to t a i is the total 
number of sites in the torus. The average density over n iterations is defined as: 

n 

p=-J2x t . ( 2 ) 

We define the parameter of magnetization as the distance to the known fixed 
point i 

1 " 1 

( H ) = -^Z\ x t~ d- 

n i=l 2 



(3) 
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The system exhibits three basic behaviors depending on p. For p values below 
the critical probability p c , the system is of ferromagnetic type, see Fig. 2a (left 
panel). For p values greater than p c , the system is like the paramagnetic state 
in Ising model, Fig. 2c (right panel). When p approaches the critical probability, 
large clusters are formed in the lattice; see Fig. 2b (center panel). In order to 
obtain a quantitative clraraterization of these observations, we have conducted 
a detailed analysis of critical exponents using finite size scaling theory. 




Fig. 2. Illustration of the three regimes. They correspond to the following states in 
Ising models: (a) p = 0.11 < p c - ferromagnetic; (b) p = 0.13423 ss p c - critical; (c) 
p — 0.20 > p c - paramagnetic. 



4.2 Estimation of Critical Probability and Critical Exponents 



In addition to the magnetization, we introduce the susceptibility % and the cor- 
relation length £. In the infinite-size limit, the following power laws are expected 
to hold: 



m ' 


~ (P ~ Vcf for p 


> Pc 


( 4 ) 


X - 


1 \p-Pc\~ 7 for p 


Pc 


( 5 ) 




' \P-Pc for p 


— t Pc 


( 6 ) 



Finite size scaling theory tells us that the fourth order cumulants of the magne- 
tization are expected to intersect each other at the unique point which is inde- 
pendent of lattice size L. The intersection of the curves determines the critical 
probability. For example, the fourth order cumulants are defined as 



U(L,p) 



(m 4 ) 

(m 2 ) 2 



( 7 ) 



where ”( )” means to take the average over all runs. We calculate U(L,p ) for 
several probabilities and for lattice sizes 45, 64, 91, 128. The critical probabil- 
ity estimated from their intersection points is 0.13423 ± 0.00002. For each RCA 
configuration, simulations run for 10 s ~ 10 9 steps, until they produced a steady 
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Fig. 3. The exponent of the correlation length is estimated by using finite size scaling 
theory. The correlation coefficient is displayed in the parenthesis followed the estimated 
exponent. 



estimation of the fourth order cumulants. The critical exponent, which corre- 
sponds to the correlation length £, is estimated according to finite size scaling 
theory [26]: 

oc L" (8) 

To estimate numerically the derivatives in Eq. 8, we fit four adjacent points with 
cubic interpolation. This approach has shown numerically precise and stable 
results [27]. The graphs are depicted in Fig. 3. The exponent were estimated 
using fp\p c , | p c , dl ° 9 ^™ ^ \ Pc , and dlog \ Pc . In the calculations, 

the previously determined value p c = 0.13423 is used. The average value over 
all estimations has been used to obtain the final estimation of v = 1.0429. To 
estimate the remaining two exponents (3 and 7, we again apply finite size scaling 
theory. The results are given in Table 1. Finally, we evaluate the consistency of 
the estimation of the critical parameters. It is known that the identity function 
2/3 + 7 = 2^ holds for Ising systems. Let us denote the difference between the left 
hand side and right hand size of the above equation as I er ror = 2/3+y — 2v. Table 
1 includes calculations with Toom cellular automata TCA [27] and coupled map 
lattices CML [29]. 



Table 1. Comparison of RCA and several other models 





P 


7 


V 


Ierror 


RCA 


0.1308 1.8055 1.0429 0.02 


TCA 


0.12 


1.59 


0.85 


0.13 


Ising(2D) 0.125 


1.75 


1 


- 


CML 


0.115 


1.55 


0.89 


0.00 
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Fig. 4. Magnetization as the function of the noise level for various non-local topologies; 
circle - local; dot - 25% of sites have 1 nonlocal: 25%(1); square - 100%(1); diamond - 
100% (4). 



Table 2. Critical exponents for various connection patterns 



Pc P 7 v I error 

Local 0.1342 0.1308 1.8055 1.0429 0.02 

25%(1) 0.1702 0.2262 1.4687 0.9760 0.03 

100%(1) 0.2052 0.3943 1.0981 0.8780 0.13 

100%(4) 0.2247 0.4141 0.9800 0.8759 0.06 



5 Critical States in Non-local Random Cellular Automata 

The introduction of non-local interaction is strongly biologically motivated. The 
neurons have most of their connections in their direct neighborhood, but they 
have also axonal links with other neurons outside of their surround. Clearly, 
axonal conductance plays an important role in actual neuronal circuits. We ap- 
proximate long-range axons by replacing several local (direct neighbor) connec- 
tions with new neighbors randomly selected from the whole lattice. During this 
rewiring process, each site maintains the same number of connections (4). There- 
fore, each time a site get a non-local neighbor, we cut out (randomly) one of its 
local neighbors. In the extreme case, when we cut out all 4 local neighbors of 
every site, we get an architecture that represents a mean field model. 

Figure 4 shows the effect of non-local connections on the critical behavior. 
We observe critical p in models with non-local links. However, in accordance 
with the intuition, p c becomes larger as the proportion of remote connections 
increases. The curves in Fig. 4 belong to models: local (circle), 25% of sites 
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having one non-local link (dot), 100% of sites have one remote link (square), and 
all sites have all 4 connections remote (diamond). Table 2 includes the critical 
exponents obtained by the finite size scaling method for each model structure. 
It is interesting that the various exponents drastically deviate from the values 
represented by the Ising universality class. At the same time, the hyperscaling 
relationship remains valid to a significant degree. 



6 Discussion and Conclusions 

In this paper we have introduced the biologically motivated neuropercolation 
model. We have identified two critical parameters: the noise level, and the non- 
locality. Noise modeled the self-sustaining, randomized, steady state background 
brain activity is the source from which ordered states of macroscopic neural 
activity emerge, like the patterns of waves at the surfaces of deep bodies of water. 
Neural tissues, however, are not passive media, through which effects propagate 
like waves in water. The brain medium has an intimate relationship with the 
dynamics through a generally weak, subthreshold interaction of neurons. The 
synaptic interactions of neurons provide weak constraints on the participants, 
and the resulting covariance appears in the form of spatiotemporal pattern. In 
the framework of this research we use mathematical tools of percolation theory 
to explore the information processing mechanisms in brains as the randomized 
activity of myriads of neurons that provides the basis for self-organization and 
higher brain functions. The present works belong to the basic building block of 
our neurodynamic model based on K sets [30, 31]. This is the single KI layer of 
homogenous elements. A higher complexity level is represented by the KII model 
which has interacting excitatory and inhibitory populations. KII may exhibit 
nontrivial limit cycle and aperiodic oscillations. The interaction of several KII 
sets produces the Kill system with aperiodic spatio-temporal dynamics. Kill has 
the potential of spontaneous phase transitions propagating in space and time. 
Research in this direction is in progress. 
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Abstract. The Hybrid Cellular Automata (HCA) modelling framework 
can be an efficient approach to a number of biological problems, particu- 
larly those which involve the integration of multiple spatial and temporal 
scales. As such, HCA may become a key modelling tool in the develop- 
ment of the so-called integrative biology. In this paper, we first discuss 
HCA on a general level and then present results obtained when this 
approach was implemented in cancer research. 



1 Introduction 

Traditionally, mathematical modelling of biological problems has focused on the 
integration of the most crucial properties of the phenomenon under study into a 
model formulated in terms of continuum ordinary differential equations (ODEs) 
and/or partial differential equations (PDEs) [23]. However, these methods im- 
pose a significant restriction on the modelled system’s time-scales. 

Physiologically-structured models [11,6] are one of the many approaches pro- 
posed to cope with this problem. In this paper we will focus on the hybrid cellular 
automata (HCA) , a multiple scale individual-based framework for modelling bi- 
ological processes. 

The paper is structured as follows. In Section 2, biological complexity and 
complexity in cancer are introduced as limitations to the traditional modelling 
approach. In Section 3, we give a general introduction to the HCA framework 
and its uses in biology, focusing on its integrative capabilities. In Section 4, we 
focus on a particular application of the HCA framework, namely evaluation of 
the efficiency of CHOP chemotherapy on non-Hodgkin’s lymphoma. 



P.M.A. Sloot, B. Chopard, and A.G. Hoekstra (Eds.): ACRI 2004, LNCS 3305, pp. 444—453, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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2 Biological Complexity and Complexity in Cancer 

Biological complexity has been recognized as a limitation to the current math- 
ematical research approach, particularly in areas such as physiology, molecular 
biology and genetics [29]. Complexity in pathophysiology and therapeutics may 
be due in part to the diversity of the levels of our knowledge: gene, molecular, 
cellular, tissue, organ, body and population. All entities of a living organism 
interact through quantitative functional relations with time scales varying from 
nanoseconds to the organism’s lifespan. This observation has led to the develop- 
ment of so-called systemic or integrative biology [16] and to the exploration of 
new methodologies [21], which might be more appropriate for studying complex 
and heterogeneous diseases such as cancer. 

It is necessary to understand the many intricacies of cancer in order to design 
efficient treatment. Many approaches to anticancer treatment have had limited 
success. Certain biological properties of cancer render it even more problematic 
than other complex diseases. One fundamental obstacle to cancer therapy is ac- 
quired tumor “robustness”, i.e. a self-organizing system which builds resistance 
to treatment [17]. Another feature is the multitude of intricate pathways for 
signal transduction. Though intermediates of multiple signalling pathways have 
been identified, understanding their function has proved to be an extremely 
difficult task [18]. The increasing evidence of cross-talk between pathways via 
signal transactivation adds an additional degree of complexity which is difficult 
to incorporate into traditional modelling approaches. Only fully integrative de- 
scriptive methods, capable of dealing with multiple scales, may assess disease 
and afford reliable treatment prediction. In this context, HCA models possess 
such capabilities. 

3 HCA Modelling of Biological Processes 

Cellular automata (CA) models have been applied to many areas of biology 
(see [14,22,1,4] for an overview). In recent years, a modification to the classic 
definition of CA has been introduced, yielding the so-called HCA. 

The classic definition of CA involves only local rules for the evolution of the state 
of a given element: the transition rules that define the site dynamics depend 
only on the configuration of its spatial neighborhood. However, many biological 
processes depend upon non-local signalling cues or nutrients. Chemical cues and 
nutrients are usually diffusive substances, smaller than the typical cell. 

Nutrient spatial distribution and various signalling processes play a fundamen- 
tal role in tumor growth [12,24,5], as well as in developmental processes [20]. 
Therefore, in order to obtain an appropriate description of these processes in 
CA modelling, it is necessary to expand the original setup to incorporate these 
non-local effects. This expansion is the essence of the HCA framework, which 
constitutes a first step towards an integrative (multiple scale) approach to biol- 
ogy- 

Given that molecules such as chemical cues and nutrients are usually small 
when compared to the size of a cell, they can be described in terms of continuous 
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fields that evolve according to appropriate reaction-diffusion equations (RDEs). 
Signalling cues are secreted by the cell’s internal machinery in response to either 
internal or external stimuli and feed back at the population level, altering the 
observed macroscopic pattern. The macroscopic structure of the vascular system 
affects the nutrient supply to the cells. In turn, nutrient levels modulate internal 
cellular processes such as cell division [6]. The HCA concept has recently been 
expanded to take into account such intra-cellular processes [6,25]. 

The HCA approach has been adopted to study various aspects of tumor 
growth. The model proposed in [12] is formulated as a two-dimensional HCA 
(or, more precisely a lattice-gas model) and reproduces many of the features of 
avascular tumors in vitro, e.g. their layer structure. In [24], a hybrid CA model 
of tumor growth in the presence of native vasculature is proposed to analyze the 
role of host vascular density and tumor metabolism on tumor growth. It seems 
that unlike normal cells, which use aerobic metabolism, tumor cell metabolism 
is glycolytic. One by-product of glycolysis is increased acidity. Since tumor cells 
are more resistant to acidity than their normal counterparts, it appears that can- 
cer uses the glycolytic phenotype (which produces H + ions) in order to increase 
its invasiveness. Several results regarding the interplay between vessel density, 
increased acidity, and tumor progression were obtained in this study. One of 
the most significant conclusions is the existence of a sharp transition between 
states of initial tumor confinement and efficient invasiveness when H + produc- 
tion passes through a critical value. This phenomenon has been observed in the 
clinic [15]. Finally, studies proposed in [7,8] use HCA in order to obtain realistic 
models for blood flow dynamics. 

Recently, HCA has been applied to study the effect of blood flow heterogene- 
ity on tumor growth [5] . Oxygen reaches the tissues via the vascular system. Due 
to the highly complex nature of blood flow and its interaction with the structure 
of the vascular system, which is also affected by the metabolic needs of the sur- 
rounding tissue, blood flow appears to be highly heterogeneous. Consequently, 
the spatial distribution of blood-borne nutrients and drugs is also heterogeneous. 
This heterogeneity has significant implications on tumor growth [5] and therapy, 
as we will observe in the following sections. 

In the next section, we will discuss a particular example of the application of 
HCA in order to evaluate the efficiency of current clinical protocols for CHOP 
therapy of non-Hodgkin’s lymphoma. 



4 CHOP Therapy for Non-Hodgkin’s Lymphoma: 
Insights from an HCA Model 

In this section, we present an application of HCA for assessing chemotherapy 
treatment for non-Hodgkin’s lymphoma (NHL) [25]. 

NHL patients are currently treated with CHOP chemotherapy 
(Cyclophosphamide, Doxorubicin, Oncovin, Prednisone) in which Doxorubicin 
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and Cyclophosphamide are the more active drugs [19]. CHOP is usually ad- 
ministered over a total of 6 to 8 cycles separated by 21-day intervals [10]. The 
relationship between this dosing interval and the efficiency of NHL CHOP treat- 
ment has not been systematically analyzed. However, theory suggests that the 
success of cancer chemotherapy is primarily determined by the frequency of drug 
administration [2,3]. 

4.1 Methods 

A two-dimensional HCA-based mathematical model aimed at simulating the 
effect of Doxorubicin on NHL was developed. The model takes into account two 
key factors which influence the efficiency of drug delivery: 

— coupling of NHL growth to the vascular network [31], which affects the struc- 
ture of the blood vessels; 

— blood flow heterogeneity which results from this diverse construction. 

The domain corresponds to a 2 mm square tissue initially filled with NHL cells 
forming a random pattern, and composed of 110 vessels whose radii are subject 
to adaptation through vessel structural modification processes [5]. 



blood vessels 
(honeycomb) 

cancel cells 
(black dots) 

empty spaces 
(white) 



Fig. 1 . Representation of NHL cells and the honeycomb-like vascular network on the 
computational domain (see [5]). 

Left: a fully populated domain; Right: the domain following significant cell depletion. 





The blood flow in each vessel is assumed to be laminar steady Poiseuille flow. 
For the dynamics of nutrient and drugs, the adiabatic approximation is applied, 
according to which chemicals (nutrient and drug) can be considered instan- 
taneously in steady state. For drug pharmocokinetics (PK), i.e. decay of the 
blood-borne drug, a one-compartment model, in which the drug concentration 
in plasma over time declines exponentially, is considered. For drug pharmaco- 
dynamics (PD), i.e. effect of the extracellular concentration of drug on NHL 
cells, a simple logit relation is used for determining the probability for a cell to 
survive the drug. See the Appendix for the continuous model equations and [25] 
for further details. 
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The initial cell colony is composed of NHL cells divided into two categories: 
proliferative and quiescent cells. 

To model cell division, a simple cell-cycle model is considered in which the 
duration of each phase of the cell-cycle was set according to various NHL-kinetic 
studies [9,13,27]. Each cell is assigned an age, which increases at each iteration. 
Thus, the cells progress through the different stages of the cell-cycle. Normal 
progression through the cell-cycle may be disrupted by lack of nutrient, leading 
to quiescence or cell death, or cells may be killed by the drug. A significant 
attribute of this model is the ability of the cell colony to influence vascular mor- 
phology. Normal vasculature is known to be well organized and endowed with 
smooth muscle cells and pericytes. This allows the vessels to adapt their struc- 
ture to various mechanochemical stimuli [28]. Due to neovascularisation, cancer 
vessels are immature and therefore lack this structure. Consequently they are 
not able to undergo structural adaptation. Furthermore, cancer cells can desta- 
bilize established, mature vasculature, rendering it immature [31]. Therefore, 
in the model presented in [25] whenever a vessel is engulfed by cancer cells it 
is assumed that it loses its ability for adaptation and its radius is fixed at ran- 
dom. Vessels not surrounded by lymphoma cells retain the structural adaptation 
mechanism [5]. The status of all vessels (mature and immature) is updated at 
each time step. 

The following CA rules hold for each cell and at each time step of the model. 

— The probability of cell death is determined by the drug concentration and 
PD; 

— If the cell is not killed by the drug, it advances one time step in its cycle 
phase; 

— Between G1 and S-plrase, the cell can either die, be arrested, or continue 
progressing through the cell cycle according to its local environment, i.e, 
local concentration of nutrient and over-crowdedness. 

— If the environmental conditions are appropriate, the cell enters into G2 and 
divides, daughter cells moving towards higher nutrient concentrations. 



4.2 Results 

When the dynamics of a simulated NHL cell colony under CHOP chemotherapy 
were examined [25], a significant phenomenon was observed. After the initial 
effect of a drug application, the tumor begins to regrow at a steady and rapid 
rate. However, beyond a certain point, the cell colony’s growth ceases to be sta- 
ble and begins to display unpredictable oscillations of significant amplitude (see 
Figure 2). Blood flow heterogeneity appears to be a key factor in this result 
and its effect is illustrated in Figure 3, which compares cell recovery from a 
chemotherapy cycle when the vessel maturation/destabilization process is taken 
into account, compared to a case in which this assumption is relaxed [25]. Con- 
sequently, one of the conclusions from this study is that in order for treatment 
to be efficient, additional drug cycles must be administered before the tumor 
can enter the unstable stage of its regrowtlr. Note that we have considered a 
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Fig. 2. Model prediction of tumor growth with Doxorubicin chemotherapy treatment 
cycles separated by 21 days. 



regular hexagonal array of blood vessels which is unrealistic, in actual tumors 
the vasculature is very heterogeneous so we would expect the effects of blood 
flow heterogeneity to be even more pronounced. 

The HCA framework has also been used to describe tumor structures in 
two pathophysiological settings in a study of which the purpose was to predict 
the efficacy of two different conventional strategies for chemotherapy intensifica- 
tion [26]. Results suggest the existence of a critical drug regimen intensity (Cl) 
value, i.e. the ratio between the total dose administered and the duration of 
the treatment. If the regimen intensity is lower than Cl, the tumor succeeds in 
recovering to its initial size over the duration of the dosing interval. 



5 Conclusion 

HCA can be viewed as an effective means of dealing with some of the problems 
raised by biological complexity. Through its ability to integrate different tempo- 
ral and spatial scales, it constitutes a promising investigative tool for analyzing 
complex biological systems such as cancer and cancer therapy. In the example 
we presented, an HCA model has been used for investigating the efficacy of cur- 
rent and potential therapies of non-Hodgkin’s lymphoma. Within the context 
of certain model assumptions, our results have raised relevant and interesting 
conclusions on the issue of treatment efficacy. 
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Fig. 3. Model prediction on the effect of vessel maturation/destabilization process on 
cell population recovery following a 10 mg/m 2 Doxorubicin administration. 

Thin line: with vessel maturation/destabilization; Empty circles (thick line): no vessel 
maturation/destabilization is assumed. 
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A HCA Model Equations 

A.l Blood Flow 

Assuming Poiseuillc flow, the flow rate ( Q ) and resistance ( Z ) in each vessel are 

given respectively by: 



Q = 



AP 

"IT 



z = 



8/z(r, H)L 
7 rr 4 



(1) 

(2) 



where AP is the pressure drop between two points of the network, L, r, and H 
are respectively the resistance, length, radius, and hematocrit, /i is the radius 
and hematocrit dependent viscosity [5]. 



A. 2 Vessel Structural Modification 

We assume that the radius of each immature vessel (r* m ) is modified at each 
time step according to the equation: 

Dm — Tmat * (1 e) (3) 

where r m at is the initial radius of the mature vessels, and e is a random number 
uniformly distributed in the interval (0,3) according to [30]. 
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A. 3 Nutrient and Drug Diffusion 

Assuming adiabatic conditions, the diffusion equation for the concentration 
( C(x,y,t )) of nutrient or drugs is given by: 

A'V 2 C( x, y, t) - q( x, y) ■ C(x, y,t) = 0 (4) 

where K is a diffusion coefficient and q(x, y) the uptake coefficient at position 
(x,y). 

On the vessel walls, we impose the boundary conditions: 

-Kn v ,-VC(x,y,t) = P-(C b -C) (5) 

where n w is the unit vector, orthogonal to the vessel wall, C b is the drug or 
nutrient concentration in the blood, and P the permeability of the vessel. 

On the edges of the computational domain we impose no-flux boundary condi- 
tions: 

n\on ■ S7C(x,y,t) = 0 (6) 

where n\ga is the unit outward vector, orthogonal to the boundary of the domain. 



A. 4 Doxorubicin PK/PD 

The decline of drug concentration in plasma (Cb) is given by: 



dC b 

dt 



-k ■ C b (t ) 



( 7 ) 



with initial condition: 



0,(0) 



dose 



(8) 



where V,i is the volume of distribution of the drug, and k the fraction of drug 
which is eliminated from the compartment per unit time, inversely related to 
the half-life t]/ 2 : 



ln( 2) 
tl/2 



(9) 



The survival fraction SF (percentage of cells that survives the drug at each time 
step) is given by: 



a • C b (t) 
C b (t) + Eci/2 



(10) 



where C b (t) is the relevant drug concentration and a, Ec \/2 are constants. 
See [25] for model parameters and further details. 
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Abstract. The dynamics of immune response to initial infection and 
reinfection by the same pathogen sometime later, are considerably dif- 
ferent. Primary response, which follows initial infection, is characterised 
by relatively slow precursor cell activation and population growth rates, 
with a consequent elongated pathogen clearance profile, typically ex- 
tended over six days or more. On the other hand, secondary response (to 
reinfection by the same pathogen some time later) is notable for short 
effector activation time, high specificity of response, rapid pathogen elim- 
ination and high degree of memory cell participation. In this paper, we 
present a seven state non-deterministic finite automata (NFA) of the ef- 
fector T cell lifecycle, which is encoded as a set of states and state tran- 
sitions. Our objective is to study the degree to which variable infection 
outcome is dependent on the accumulation of chance events. Such chance 
events may be represented as the consequence of premature, delayed or 
even failed state transitions. We show how small variation in crucial state 
transitions probabilities during the lifecycle can induce widely variable 
infection outcomes. This model is implemented as a spatially extended, 
concurrent two-dimensional stochastic cellular automata, executing on a 
MPI-based Linux cluster. 



1 Introduction 

Cellular Automata (CA) have been applied to numerous areas of complex physi- 
cal systems modelling [1]. CA have several important characteristics which make 
them amenable to efficient computational implementation, including ease of rep- 
resenting (in the form of ?r-dimensional arrays), discrete nature of the underly- 
ing computations, simplicity of rules or laws which are programmed into the 
CA, and the highly repetitious nature of the processing steps. However, cellu- 
lar automata posses additional fascinating properties, for example, patterns of 
self-organisation of a complexity which cannot be derived numerically from the 
rules on which the underling cellular automata is based. As a result of this com- 
plexity, [2] has postulated that some form of CA must underlie many complex 
physical phenomena visible in nature. Furthermore, with the application of non- 
deterministic ( stochastic ) cellular automata, the idea of randomness in CA site 
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selection and update rule enforcement has yielded further insight into modelling 
stochastic natural systems such as molecular motion, turbulence in water flow 
and various biological processes, especially models of the human immune system 
[3, 4, 5, 6, 7]. In this paper we present an approach that seeks to avoid a computa- 
tional modelling process exclusively influenced by current experimental immune 
system research trends. We propose a relaxation of the deterministic assump- 
tions inherent in earlier work [8] , and explore the dynamics of a more stochastic 
system. Stochastic events appear to play a crucial role in certain immune system 
functions [9]. The contribution of this work is as follows: (i) an extended non- 
deterministic state transition model of the effector T cell lifecycle is introduced. 
This model successfully reproduces time-realistic effector and pathogen popu- 
lation dynamics during primary and secondary response, and during repeated 
reinfection, (ii) we identify three stages in the effector T cell lifecycle model 
which are critical in regulating the course of primary and secondary response, 
and (iii) the model is implemented as a spatially extended two-dimensional cellu- 
lar automata lattice executing concurrently on a MPI-based Linux cluster. This 
allows us to scale the model to cell density levels in the order of 10 6 CTL cells - 
which approaches levels typically observed in in-vivo murine experiments. This 
work is arranged as follows: section 2 is a brief overview of some key features 
of the adaptive immune response which we model, and serves as an introduc- 
tion to some specific terminology. Section 2 is intended for readers who may be 
unfamiliar with general principles of immunology. Section 3 discusses the model 
structure and explains the motivation and implementation of the underlying 
non-deterministic cellular automata. Section 4 presents results of the simula- 
tion, and in particular, some interesting features which emerge. Finally, section 
5 is a discussion of the results, and an outline of further enhancements. 



2 Adaptive Immune Response 

Common to all immune systems is the principal of sensing of localised space for 
the purposes of intrusion detection. Intrusion, in this case, is the appearance 
of a bacteria, viral particle or infected cell which may be classified as non-self. 
Any non-self genetic material discovered must be eliminated in order to prevent 
infection (or even death), of the host. Broadly speaking, the means by which 
the non-self intruder gained access to the blood stream or lymphatic compart- 
ments is not of interest 1 . There are a great variety in the pathogen challenge 
and immune response course (not all of which are a concern here). One such 
scenario arises as follows: when a viral particle has been taken up by an antigen- 
presenting cell (APC), such as a dendritic cell, it is degraded into one or more 
peptide chains within the cytosol region of the APC, and is then bound to the 
major histocompatible complex (MHC) class I molecule (a process known as 
antigen processing) before finally being presented on the surface of the APC as 

1 Some viruses, for example, the influenza and corona viruses, enter the host through 
the air passages and not through tissue damage. 
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an MHC:peptide complex, a process known as antigen presenting. APC will re- 
circulate through the lymphatic system in order to alert the immune system to 
an infection. Sensing of the lymphatic compartments (of which there are many) 
for antigen-presenting cells, is a systematic function of immune cell (lymphocyte) 
recirculation. Cytotoxic lymphocyte (CTL) precursor cells constantly recirculate 
and sample their environment in the search for foreign pathogens. The process 
of sampling involves two cells binding for some small time period, during which 
the immune cell senses the receptor of the bound cell to determine if the bound 
cell is an invading pathogen (or not). If the bound cell is an invading pathogen, 
the immune cell may be stimulated to produce clones of itself in order to attack 
and remove other cells bearing the same genetic material. Under normal cir- 
cumstances, the production of clones ceases after some fixed period of time, and 
once the infection has been cleared, most CTL cells will undergo programmed 
death (apoptosis). A small subset of the clone population will remain activated 
indefinitely, and this population represents effector memory. In the presented 
here, we do not model free antigen, but only antigen epitopes which have been 
bound to the surface of an antigen presenting cell. 

3 The Model 

Our model runs in discrete 30-minute timesteps, and all entities in the model act 
asynchronously at each timestep (t) . As primary response normally consists of 4 
days of cell replication (clonal expansion), the cells in our model will stop dividing 
at r = 192. The recirculation space of the lymphatic compartment is modelled 
as a two dimensional stochastic cellular automata lattice of length L = 10 3 , 
with periodic boundary conditions and neighbourhood radius r = 1 (in two- 
dimensions), with a maximum of 8 neighbours. Each site is selected at random 
for update during the timestep. Not every site is will be visited at each timestep, 
but each site can be updated at most once in any given timestep. At r = 0 some 
5000 antigen entities are introduced into randomly selected sites on the lattice 
(following a uniform distribution), and the model executes until r = 3000 (62.5 
days of elapsed time). The CTL population grows exponentially in response 
to APC stimulation, with a clonal expansion rate which is a function of the 
affinity between the CTL and APC. The dynamics of affinity are modelled using 
shape space [10,11]. The stimulation rate never exceeding 0.036, which yields a 
population of daughter clones of ~ 1000 after 4.5 days of clonal expansion. Each 
lattice site may contain only one entity at any given timestep. The set of entities 
and states supported is shown in Table 1, which also introduces some important 
notation used throughout this paper. 



3.1 Non-deterministic Finite Automata 

To allow the study of a distribution of possible outcomes, we identify a subset of 
the CTL lifecycle state transitions, and replace the certainty of a transition from 
state w to state v on event e with some probability (< 1) of state transition. 
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Let us start by defining what is meant by state transition relaxation: If X is a 
discrete random variable {drv) representing the transition from state w to state 
v, and e is some event linking wv, the relaxed state transition X r is: 

P(X r \e) = 0 < ij) < 1 (1) 

The choice of value for if) will naturally depend on the wv in question. In 
contrast to earlier models , Eq. (1) implies duality in the presence of event e: 
transition on e ( X r ) or not ( X r ). This extension results in a non- deterministic 
finite automaton (NFA) [12]. Fig. (1) is a non-deterministic finite automata 
model of the lifecycle of the CTL (and follows notation explained in Table 1). 
E is the set of events the model, and consists of both deterministic and non- 
deterministic elements. We define a subset of three critical non-deterministic 
events S C E as: S = {e 2 , 8 , £ 3 , es}. Each e* £ S is defined as follows: 

e 2 ,e 8 An infected antigen presenting cell will be destroyed by a bound cyto- 
toxic lymphocyte cell which recognises it. Recognition is a function of the 
distance between the two cells in shape space. 
e-i An activated proliferating immune cell (state ctl + *) will normally end 
clonal expansion on the event (e 3 : age(ctl + *) > 192). 
e6 The fraction of effector T cells entering the long-lived memory pool. Nor- 
mally the majority of activated effector cells undergo programmed cell 
death (apoptosis) at the end of primary response. However, recruitment 
to the memory pool consumes around 5 — 10% of activated CTL [13,14, 
15], thus, a further stochastic transition occurs on ee, with of ctl + i enter 
ctl + * on event (e§ : age(ctl + 1) > 192). 

e rp t Repeated reinfection events, resulting in repeated doses of infected antigen 
presenting cells introduced into the simulation, at timestep r + 300n, n = 
0 ,1,..., 9. 

Each of the above events (e„) has an associated probability ip n . The set 
{V’l, ^2) ^3> ^ 4 }) therefore fully describes each simulation configuration of the 
model (all other parameters being kept constant) . In the results presented in the 
following section, we define the following four experimental onfigurations of V: 

1. Pi : {0.9, 0.9, 0.9, 0.0} 

2. V 2 ■ {0.9,0.9,0.95,0.0} 

3. V 3 ■ {0.9, 0.9, 0.9,1} 

4. Va ■■ {0.9,0.9,0.95,1} 

The first two configurations of V test the fidelity of the model response when 
confronted with a singular secondary infection event some 30 days after the initial 
infection. The first configuration represents a normal response and is intended to 
calibrate the model for near optimal conditions. For Pi, we would expect to see 
CTL production levels broadly characterised by low, elongated peak for primary 
infection, followed by an increase in memory CTL. Another expected observation 

in that P(X r \e) = 1 
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Fig. 1 . A seven-state non-deterministic finite automata of the cytotoxic lymphocyte 
cell lifecycle. Transition events (e„), which carry the same label, are non-deterministic. 



is APC clearance: over some 6 — 10 days for primary response, and significantly 
faster during secondary response. The second configuration is an increase in ips 
from 0.9 to 0.95 and is intended to test the impact of a 5% decline in the number 
of cells which transition to the effector memory state (ctl +1< —> ctl + ®). Some viral 
infections are known to case damage or loss of the memory pool [16], and we 
test to see the impact this has on our model. We test repeated reinfection with 
normal and depleted memory cell production (V 3 and V 4 , respectively). Many 
pathogens are known to lead to acute and persistent viral infections, and we test 
the importance of memory cell production in these cases. Again we deplete the 
memory production by 5% and study the consequences of this loss. Section 4.1 
examines the results of persistent infection in our model. 

4 Results 

The model is initially executed with parameter set V\ and V 2 (with no repeat 
reinfection), and the results are shown in Fig. 2. In (a), the initial infection 
is visible at r = 0 with pathogen density pd = 5000, (the broken line) and 
consequent effector response reaching a maximum value at r = 300, with = 
8.2 x 10 3 . Fig. 2(b) shows the antigen presenting cell population level (only). 
No memory cells are present during primary response, and as such, the effector 
cell population is made up entirely of clones produced by stimulated precursor 
cells. To the right of each effector cell peak is a plateau of memory cells. The 
slope of the CTL density peak is extreme, indicating that the state transitions 
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Table 1 . Notation and definition of model entity states 



Notation 


Definition 


ctC 


naive recirculating effector precursor 


ctl + * 


proliferating lymphocyte 


ctl + ‘ 


dead activated lymphocyte (apoptosis) 


ctl + ® 


activated memory effector 


ctl +(B * 


activated proliferating memory effector 


ctt+ et 


activated memory effector 


ctl +f 


armed activated effector 


apc + 


active infected antigen presenting cell 


apc + * 


dead infected antigen presenting cell 



from ctl + * to ctl + 1 to ctl + ® (or ctl + *) occurring with a high degree of certainty. 
At time r = 1500 (day 31), secondary exposure to the same pathogen occurs, 
and the model exhibits following general behaviour: (i) the secondary immune 
response is preceeded by a pool of committed CTL memory cells which have 
already been primed to respond to the re-appearing pathogen, (ii) the activated 
CTL density is some 10 times higher than primary response, and does not last as 
long, and (iii) the pathogen is reduced to half its original level much more rapidly 
than during primary response. With V\, the model exhibits efficient detection 
and clearance behaviour associated with a healthy immune system. From Fig. 
2, it can be seen the advantage in both time and infected cell clearance which is 
conferred on a response based largely on memory: the half life of the virus during 
primary response is around 3.25 days, with 90% pathogen clearance achieved at 
around r = 480, or 10 days of simulation time. Compared to secondary response 
on reinfection we see an infected cell half life of r « 60 or 1.25 days - an efficiency 
of around 87%. Effectively, this is because memory cells, having already been 
primed by a previous encounter with the specific pathogen, undergo expansion 
with lower death rates than during primary response: they therefore accumulate 
more quickly [17]. The results for V 2 are shown in Fig. 2(c) and (d). Here, 
the probability of entering apoptosis is increased from 0.9 to 0.95. This means 
that the memory cell population would be around 5% of that activated effector 
population post-primary response. Recent work (notably [17]) has shown that 
some w 90% of activated effector undergo apoptosis after primary response. 
Therefore, ip 3 = 0-95 would represent an unusually high suppression of memory 
function. Clearly, the reduction of memory effector production should not effect 
primary response, and this is borne out by CTL density levels prior to r = 1500 
(c). We see a normal 10-day clearance regime (d) during primary response, but 
a less effective response during reinfection: in fact, the memory cell pool in the 
time range 500 < r < 1500 has fallen to ss 500. Once reinfection occurs, the APC 
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Fig. 2. CTL and pathogen lattice density levels (a),(c) over a simulated 62.5 day period, 
with an initial infection at time r = 0 and a reinfection by the same pathogen occurring 
at r = 1500, for 3 values of V ■ Antigen presenting cell (APC) density is shown by the 
broken line, with the solid line indicating levels of effector memory and activated cells 
combined. For clarity, (b),(d) show population levels for APC for each V ■ 



population is cleared some 31% more effectively than during primary response. 
The APC half life is r = 108, 90% clearance is achieved after reinfection at 
r ss 1788 (or some 5.9 days of simulated time). However, the characteristics of 
V -2 are significantly degraded compared to that observed in V\. 



4.1 Persistent Reinfection 

Some viral pathogens are capable of persistent reinfection, in that, although pop- 
ulation levels of infected antigen presenting cells may decline in response to clear- 
ance pressure by a specific CTL response, over time, the number of infected cells 
rises to chronic and sometimes acute levels. Examples of such viruses are HIV, 
HTLV, hepatitis C (HCV), hepatitis B virus, CMV EBV and rubella [16]. Such 
persistent reinfection pathogens have been associated with normal immune func- 
tion suppression. In this section, we simulate persistent reinfection by randomly 
scattering a repeat ‘dose’ of the pathogen, introduced at r + 300?r, n = 0, 1, ..., 9. 
This reinfection pattern is a represents a resurgence of infected cells every 6.25 
days, in discrete bursts. The results of this simulation are shown in Fig. 3. 

With respect to Fig. 3 (a), the response to the first reinfection is clearly 
strong: some 3.8 x 10 5 lymphocytes are generated and the reinfection is rapidly 
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Fig. 3. The model is exposed to repeated infection events, arising at time r = 300 n, n = 
0, 1, equivalent to an infection every 6 days. 



eliminated. As further infections arise starting at r = 600, the existing memory 
pool never falls below 1.8 x 10 5 , and is critical in bringing the repeated rein- 
fections under control in time periods (b) which rarely exceed 130 timesteps 
(or 2.8 days of simulated time). We also see from (a) that slightly lower re- 
sponses are sufficient in order to effect optimal clearance. Results from (a) and 
(b) support the clinical findings that the memory cell levels tends to be higher 
after secondary and tertiary infections [17], which in turn, supports the clinical 
practice of vaccination boosting. Finally, when the simulation is executed with 
diminished memory cell creation and repeatedly stressed with reinfection ( V ± ), 
average primary and secondary response levels are similar (around 1.2 x 10 4 ). 
Each response is characterised by rapid expansion and reduction of effector lym- 
phocyte clones. There are no memory cells to confer clearance advantage, and 
each response is initiated from low levels (around 1.2 x 10 2 ). 

5 Discussion and Conclusions 

The approach taken in this research was to construct a stochastic model of the 
effector T cell lifecycle, in order to study a distribution of possible simulation 
outcomes. We have shown how the model reproduces well the time and space 
dynamics of initial and secondary infection. In addition, we believe the research is 
valuable in modelling the relationship between repeated reinfection and effector 
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cell transition to memory or apoptosis. We have demonstrated how repeated 
reinfection can be controlled only within a limited range of too much memory 
causes the lymphatic compartment to fill-up, too little memory induces the need 
for clonal expansion from naive precursor cells, and a elongated APC clearance 
profile. When the ratio of apoptosis to memory is ‘just right’ (0.88 < ip 3 < 0.92), 
antigen presenting cell levels (during repeated reinfection) are brought under 
control in increasingly rapid time frames. The next steps in this research are 
to test the homeostasis of our model: where does the model break down, and 
what insight does this provide. Very recent clinical work [16] suggests that the 
immune system must periodically preferentially eliminate some memory cells 
which exhibit poor cross-reactivity. One of the benefits of the the stochastic 
effector T cell lifecycle model presented here is the relative ease with which this 
theory could be investigated. The benefits of selective memory cells reduction 
may form the basis of further work with this model. 
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Abstract. Evolution of a population consisting of individuals, each holding a 
unique “genetic code”, is modeled on the 2D cellular automata lattice. The “ge- 
netic code” represents three episodes of life: the “youth”, the “maturity” and the 
“old age”. Only the “mature” individuals can procreate. Durations of the life- 
episodes are variable and are modified due to evolution. We show that the “ge- 
netic codes” of individuals self-adapt to environmental conditions in such a way 
that the entire ensemble has the greatest chance to survive. For a stable envi- 
ronment, the “youth” and the “mature” periods extend extremely during evolu- 
tion, while the “old age” remains short and insignificant. The unstable environ- 
ment is modeled by periodic plagues, which attacks the colony. For strong 
plaques the “young” individuals vanishes while the length of the “old age” pe- 
riod extends. We concluded that while the “maturity” period decides about the 
reproductive power of the population, the idle life-episodes set up the control 
mechanisms allowing for self-adaptation of the population to hostile environ- 
ment. The “youth” accumulates reproductive resources while the “old age” ac- 
cumulates the space required for reproduction. 



1 Introduction 

The cellular automata paradigm is a perfect computational platform for modeling 
evolving population. It defines both the communication medium for the agents and 
the living space. Assuming the lack of individual features, which diversify the popu- 
lation, the modeled system adapts to the unstable environment, developing variety of 
spatially correlated patterns (see e.g. [1-3]). Formation of patterns of well-defined 
multi-resolutional structures can be viewed as the result of a complex exchange of 
information between individuals and the whole population. 

Another type of correlations - correlations in the feature space - emerges for the 
models of populations in which each individual holds a unique feature vector evolving 
along with the entire system [4], The aging is one of the most interesting puzzles of 
evolution, which can be investigated using this kind of models. 

It is widely known that the aging process is mainly determined by the genetic and 
environmental features. The most of computational models of aging involving genetic 
factor are based on the famous Penna paradigm [5,6]. This model uses the theory of 
accumulation, which says that destructive mutations - which consequences depend on 
the age of individual - can be inherited by the following generations and are accumu 
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lated in their genomes. The Penna model suffers from the following important limita- 
tions. 

1 . The location of the individuals in space is neglected, thus the system evolves in 
spatially uncorrelated environment with unbounded resources. 

2. Only two episodes of life are considered, i.e., the “youth” and the “maturity”. The 
durations of the two are the same for each individual. The “old age” is neglected. 

In this paper we propose a new model, complementary to the Penna paradigm. It does 
not consider genetic mutations. Instead, it allows for studying the influence of envi- 
ronmental factors on the aging process. 

The paper is constructed as follows. First, we describe our algorithm, its principal 
assumptions and implementation details. In the following section we discuss the re- 
sults of evolution and self-adaptation of population to the hostile environment repre- 
sented by periodic plaques. Finally, our findings are summarized. 



2 CA Model of Evolution 

Let us assume that an ensemble of S(t) individuals is spread on 2D NxN mesh of 
cellular automata (CA). The mesh is periodic. Each individual, residing in (i,j) node, 
is equipped with a binary chain - the “genetic code” - of length L. The length and the 
number of “l”s in the chain correspond to the maximal and actual life-time of indi- 
vidual, respectively. Only “l”s from “genetic codes” of each individual are read one 
by one along with the evolution while “0”s are skipped. Afterwards the last “1” has 
been read, the individual is deleted from the lattice. The individuals are treated as 
independent agents, which can move and reproduce according to recombination 
( cross-over ) operator from the genetic algorithms. The code chain consists of three 
sub-chains corresponding to three episodes of life: the “youth” y, the “maturity” m 
and the “old age” o. They do not represent biological age of individuals, but reflect 
their reproduction ability. Only the “mature” individuals from the Moore neighbor- 
hood [7] of an unoccupied node of CA lattice are able to reproduce. Every individual 
can move randomly on CA lattice if there is a free space in its closest neighborhood. 

Let A = {a lj } NxN is the array of possible locations of individuals on the 2D 
NxN lattice of the cellular automata. The value of a i| e 9t, 9v={0,l}, where “0” means 
that the node is “ unoccupied ” and “1” that it is “occupied” . An individual is defined 
by corresponding “genetic code” or e 9t L such that: 



if (V 


= 1) then (ij)-»‘ 


is occupied 


°L 

y.j 


[y„> "v °*i5 

r l 2 l-i 


nr -4 [nr ,m, y ,..., m, y ], o. -A [o. ,o.. , ..., o. ], 






a y ff k , nr\ o iy k e {0,1 }, L=n+m+l 




else 






(id) ‘is 


unoccupied ’ and Ci u -A 0 



In Fig.l we show the sequence of instructions describing the process of evolution. 
The binary vectors y ij5 m i( , o u represent the subsequent episodes of individual life: the 





466 W. Dzwinel 



“youth”, the “maturity” and the “old age”, respectively. The values of l,m,n are the 
maximum lengths of each of the episodes while their actual durations are equal to the 
number of “l”s in the corresponding vectors y ij5 m ij7 o u . The symbol Q denotes the 
classical recombination operator from the genetic algorithms, t is the number of gen- 
eration cycle (time), pla^) is the unitation operator, (i.e., it returns the number of 
“l”s in a u chain) and the function p k ( ) is the “counter” operator defined as follows: 

V (a y = 1 a pCoty) > k); p^Ojj) = k. 



while t<MAX do begin 


// Initialize the following generation g. 


for i = 1 to N do begin 


// Go through every lattice site. 


for j = 1 to N do begin 




jfjkjS = 0 then 


// If lattice site (ij) is unoccupied 
// find two different “mature” individuals 


m =Jind two mature neighbors (y. dj. a 2 ) 


// in the Moor neighborhood of a^ 


/ 


//If they do not exist m=0. otherwise m=l 


ifj^m=l) then 




§!£?ij 

(Pi, P 2 ) -> n (a ls a 2 ) 


// Reproduce them. 


-» (Pi, P:M, 


// pr- a probability for selection of 
// one out of two offspring (pi. p 2 ) 


)&—■ ► 1 




else 


//If a site (ij) is occupied 


if p te (a») = p(Oij) then 


//delete it if its life-time passed 


hi -> 0, as+'i, -> 0 




else 




fc -> Pwi(a>i); 


// or make it older 


end 




end 




end. 





Fig. 1 . The pseudo-code describing evolution rules. 



We assume that, the population can be attacked by a plaque represented by “seeds”. 
The “seeds”, which are generated periodically in time, are scattered randomly on the 
CA lattice. The strength of the plague is defined by e 0 - the ratio between the number 
of “seeds” and the total number of individuals. If a “seed” is located at the same place 
as the population member, both are removed from the lattice. Otherwise, the “seed” 
moves randomly on the CA lattice until it “annihilates” with the first encountered 
individual. The “seeds” cannot reproduce. 

Our system consisting of elements with “genetic codes” evolves not only on 
CA lattice but also in the abstract multi-dimensional feature space 9t L represented by 
the coordinates of binary chains a ir As shown, e.g., in [4], the clusters of similar 
individuals are created both on the mesh and in the feature space 9i' due to the ge- 
netic drift. These clusters can be extracted using clustering algorithms [8,9] and then 
visualized in 3-D space by employing multidimensional scaling (MDS) algorithms 
[8,9]. 
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3 Results of Modeling 

The parameters assumed for a typical run are shown in Tab.l. The periodic lattice of 
cellular automata 200x200 and 100x100 were considered as optimal ones balancing 
well adequate representation and computational requirements. These parameters are 
also sufficient to obtain stable populations and partly eliminate boundary effects. 



Table 1 . The parameters for typical simulation. 



Lattice size (NxN) 


100x100, 200x200 


Mobility factor e(0,l) 


1 


Initial density (P0) 


0.2 -0.5 


Probabability of reproduction 


1 


“Youth” - length 


32 


Probabability of mutation 


0 


“Mature” - length 


32 


Plague period 


50 


“Old age” - length 


32 


Dose ( £ n ) 


0.4 



At the start of evolution, the population is generated randomly with P0 density 
(P0e(0,l), see Tab.l). Because all individuals are initially “young”, the evolution 
scenario depends strongly on P0 (see Fig. 2). For both too large and too small P0 val- 
ues, after some time, the number of offspring can become marginal in contrast to 
massive extermination of “old” individuals from the initial population. This may lead 
to fast extinction of the whole population. This effect can be considerably reduced by 
increasing mobility factor of individuals, their life-time and initial diversity of popu- 
lation. 




Fig. 2. Various scenario of the growth of population size in time for increasing P0 (initial 
population size). The simulation was started assuming that all the individuals are “young”. The 
CA lattice of size 100x100 was simulated. 
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We have assumed additionally that: 

• the length L of the vector representing the “genetic code” is equal to 96, 

• the lengths of vectors y, m, o are identical, i.e., l=m=n = 32 (see Definition 1). 

The value of L was selected intentionally to have more compact representation (thus 
more efficient code) of individual, whose “genetic code” can be implemented then as 
three float values. The value of L cannot be too small due to statistical validity (the 
number of “T’s in various episodes of individual’s life has initially the Gaussian dis- 
tribution) and due to high sensitivity of the system on various simulation conditions. 
Other configurations and vector lengths were also examined. The first conclusion is 
that, the individuals, even those with the same life-time lengths L, can behave in vari- 
ous ways depending on the lengths of subsequent life-episodes y, m, o. On the one 
extreme, the population with too short “maturity” period will die quickly. On the 
other, the populations with greater reproduction potential (defined by the length of m 
vector) will tend to fill the m part of vector a with “l”s. This is due to the population 
members who are “mature” for a longer time, have a greater chance to reproduce and 
pass their “genetic code” to other generations. One can expect that the similar behav- 
ior will be observed for “idle” episodes of individual’s life i.e., the “youth” and the 
“old age”, i.e., the individual’s life-time will increase due to the evolution to the 
maximum length L. However, the situation is completely different. 




Fig. 3. The histograms representing the number of individuals with various lengths of y, m, o 
life-episodes. For initial generation of individuals the distributions are similar (the plot on the 
left), but after g=2000 steps they diversifies considerably (the plot on the right). 



Let us assume that initially the distribution of “l”s in each of the three episodes of life 
is Gaussian and there are in average 16 “T’s in each of y, m and o vectors. These 
initial conditions are shown in Fig. 3a. After t=2000 time-steps, the situation consid- 
erably changed. The distributions of “l”s for each period of life undergo strong diver- 
sification (Fig. 4b). 

As displayed in Fig.4, the distribution of individuals both on the CA lattice and in 
the feature space, changed also dramatically. Instead of initially chaotic configuration 
of individuals populating 2D lattice, they form distinct clusters. The individuals be- 
longing to the same cluster are similar according to the Hamming distance in the L-D 
feature space. As shown in Fig. 4b there exist four distinct “families” of individuals in 
the feature space. In Fig. 4a we show them projected onto the CA lattice. 



A Cellular Automata Model of Population Infected by Periodic Plague 469 




Fig. 4. Clusters of individuals on 2D lattice (a) and in the feature space (b). Single plate in 
Figs. 4b corresponds to a group of individuals with identical “genetic codes”. Fig.4b shows the 
result of k-niean clustering in the L-D feature space transformed by using multi-dimensional 
scaling to 3-D space. Various colors in b) indicate the spurious clusters obtained using k-means 
clustering scheme. While colored 2-D clusters on the CA lattice in Fig.4a represent the four 
clusters recognized visually from Fig.4b. 

The continuation of the evolution from Fig. 4 produces a stable attractor, which con- 
sists of four “families” of individuals, which have exactly the same “genetic codes”. 
The codes differ between clusters only on two bits positions. Therefore, the offspring 
generated due to recombination belong to one of the existing clusters. We did not 
obtain any global solution with only one large cluster of individuals having the same 
genetic code. It means that the fitness factor for the populations of individuals with the 
three life periods is not a trivial, increasing function of the length of life. This is unlike 
for populations, which are “mature” and ready for reproduction during the whole life- 
time ( L=m , l,n=0). In this case the attractor of the evolution process would consist of 
individuals with “genetic codes” filled exclusively by “l”s. 

The most basic features of attractors resulting from modeling are collected in 
Tab. 2. As shown in Tab. 2, where apart from the “natural” elimination - resulting from 
the limited life-time inscribed in the “genetic code” - there are not any other lethal 
factors, the “maturity” period fills with “l”s after relatively small number of evolu- 
tion cycles t. This is obvious because longer ability of reproduction gives a greater 
chance for passing the genetic code to the offspring. By extending the evolution time 
about threefold, also the “youth” vector will be filled with ‘l’s. Surprisingly, even 
much longer simulation does not affect the “old age” vector. It remains the mixture of 
“l”s and “0”s. This observation confirms also for: 

• variable lengths of y,m.o (l&n&i), 

• long “old age” period ( n=64 ), 

• much shorter remaining episodes (1=16, m= 24, respectively). 
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Table 2. Number of “l”s in the average “genetic chains” of different lengths for corresponding 
episodes of individual’s life after 50,000 time-steps. “ mix ” - the mixture of “0”s and “l”s, 
“ perished " - the population deceases quickly. 




Fig. 5. The histograms of individuals vs. their age for a) stable (1=32, m=32, n-32), and b) 
unstable (1=0, m=40, n=64) populations. 



Surprisingly, further decrease of the “youth” episode (1=8, m=32, n=64) with respec- 
tive extension of the “maturity” episode weakens considerably the population. For 
(1=0, m=40, n=64) it dies eventually. This behavior shows that the “youth” period 
accumulates reproductive ability of the population. If released too fast it will cause 
non-uniform aging (see Fig. 5b), which may result in fast extinction of the whole 
population. 
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As depicted in Fig. 6, the population attacked by the periodic plague dies if the 
strength (“ Dose ” in Tab.l) of the plague e 0 , defined as the ratio of the number of 
“seeds” to the number of individuals, and/or its period exceeds a certain threshold. 




Fig. 6. a) The influence of periodic plague (thin line) on the number of individuals (thick line), 
b) The number of individuals (thick line) eliminated by the periodic plague (thin line) in time. 

The attractors (i.e., the non-evolving populations of individuals with similar “genetic 
codes” obtained due to the long evolution) in a stable environment die very quickly 
due to the lack of adaptation ability represented by diversification in the “genetic 
codes” of individuals. For example, a uniform population (e.g., see Tab. 2 for 
l=m=n=32 ) obtained after long evolution (t=50,000 time-steps) and attacked then by 
the plague extincts during the following 100 steps. The same population, but this time 
infected at the early stage of evolution (after t=200 steps) survives. The “genetic 
codes” of individuals self-adapt to the unstable environment. As shown in Tab. 2, the 
“genetic codes” of attractors of attacked population are different than those obtained 
for the stable environment. Moreover, they differentiate depending on the period of 
the plague. 

For the outbreak with a period shorter than the average life-time of individuals, the 
“youth” episode, as the obstacle for fast reproduction, is eliminated completely (all 
“0”s in vector y). Surprisingly, the “old age” period remains relatively long. Because 
the population can have not enough time for reproduction between subsequent 
plaques, it has to elaborate sophisticated control mechanism of growth. Let us assume 
that: 

1. the “old age” is inhibited (n=0) and the population consists of only “mature” 
individuals, 

2. the majority of individuals are eliminated by the plaque from the lattice in a very 
short time. 

At the very moment when the plaque ceases, all survivors will produce many new- 
borns due to plenty of free space on the lattice. Therefore, after some time, the indi- 
viduals of a similar age and approximately the same life-time will dominate in the 
population. Their simultaneous death will weaken the population (see Fig. 5b). Thus, 
the number of “mature” individuals, which survive after the following disasters, may 
be too small to initiate new generations and the population may extinct eventually. 

Assuming that the “old age” episode is greater than 0 ( n>0 ), post-plaque 
demographic eruption (resulting in demographic catastrophe after some time) can be 
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much smaller than for n=0. It is easy to remark that the demographic eruption will be 
monotonically decreasing function of n because only the “mature” survivors have 
reproductive ability. Moreover, the existence of “old” individuals will decrease the 
probability of reproduction. The replacement of “old” individuals with newborns will 
be also postponed and possible only after their death. All of these demographic in- 
hibitors cause that the post-plaque reconstruction of the population takes more time 
than in the previous ( n=0 ) case. Instead, the age distribution in the population is more 
stable (see Fig. 5a). Thus the population allowing the “old age” episode is stronger and 
has a greater chance to survive in unstable environment than that consisting of only 
“mature” individuals. We can conclude that the “old” individuals accumulate the envi- 
ronmental resources (free space) for stable growth eliminating dangerous post-plaque 
effects such as demographic eruptions. 

When the plague period is greater than the average life-time of individuals 
and simultaneously the “strength” of the plague increases, the “old age” is also elimi- 
nated due to evolution. This is because the population has enough time for reproduc- 
tion and demographic minimum does not coincide with the plaque. 



4 Concluding Remarks 

We have discussed the influence of the lengths of three life-episodes: the “youth”, the 
“maturity” and the “old age” on population evolution. Of course, their duration of 
depends on the biological construction of individuals. The organism requires a mini- 
mum time to grow-up and be ready for reproduction. However, the terms “youth”, 
“maturity” and “old age” used in this paper have not only biological meaning. Envi- 
ronmental factors influence both reproduction ability and the life-time. They may 
cause that the same organism can be treated as “young”, “mature” or “old” independ- 
ently on his age. 

Summarizing our findings, we can conclude that “maturity” period decides 
about the reproductive power of the population and its survival ability. Thus the 
population increases its length to a maximum value allowed. The idle episodes of life, 

i.e., the “youth” and the “old age” play the role of accumulators of the population 
resources and control their growth. The “youth” accumulates reproductive resources 
while the “old age” accumulates the space required for reproduction. The idle life- 
episodes develop the control mechanisms, which allow for self-adaptation of the 
population to unstable environment. 

1 . In the case of a stable growth the reproductive resources are accumulated in the 
“youth” episode of life. The “old age” remains the secondary control mechanism. 

2. For periodically infected populations with the period longer than the average 
length of the life-time L the population is biased only for reproduction, eliminat- 
ing idle episodes of life. 

3. For strong enough and frequent pests the „old age” remains non-zero accumulat- 
ing additional space required for burst-out of population just after the plague 
vanishes. 
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Many aspects of the model have not been explored yet. For example, the influence of 
lethal mutations and other hostile environmental factors on the survival ability of the 
population. However, our model can be an interesting complementary constituent to 
the Penna paradigm of aging. 
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Abstract. This paper introduces a Cellular Automata (CA) approach 
to spatiotemporal data mining (STDM). The recently increasing interest 
in using Genetic Algorithms and other evolutionary techniques to iden- 
tify CA model parameters has been mainly focused on performing artifi- 
cial computational tasks such as density classification. This work investi- 
gates the potential to extend this research to spatial and spatiotemporal 
data mining tasks and presents some preliminary experimental results. 
The purpose is twofold: to motivate and explore an evolutionary CA 
approach to STDM, and to highlight the suitability of evolutionary CA 
models to problems that are ostensibly more difficult than, for example, 
density classification. The problem of predicting wading-bird nest site 
locations in ecological data is used throughout to illustrate the concepts, 
and provides the framework for experimental analysis. 



1 Introduction 

Space-time dynamics are ubiquitous in both real-world and artificial systems. 
The recently emerging field of spatiotemporal data mining (STDM) opens up a 
number of possibilities for the integration of previously distinct research areas. 
Cellular Automata (CA) techniques, in particular the use of Genetic Algorithms 
and other evolutionary approaches to learn the transition rules from given pat- 
terns, show great potential to be combined with data mining techniques in order 
to tackle problems of previously prohibitive complexity. They can also give in- 
sight to improve on previous approaches. 

In order to illustrate our approach, we use a classic problem in spatial data 
mining - the prediction of nest site locations in a wading bird ecology - and 
examine the merits and difficulties of applying an evolutionary CA approach. 
The data consists of nest site locations and environment variables (e.g. vegetation 
type, water depth). Given a set of spatial patterns (the training data), the aim is 
to construct a CA neighbourhood and transition rule such that they will predict 
the locations (or significantly aid prediction) of nest sites on unseen data sets 
consisting of the environment variables only. 

Genetic Algorithms have been used to discover CA rules that will perform 
computational tasks such as density classification (complete cellular grid to be- 
come one of two states depending on which had the higher initial density) [1], 
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synchronisation (all cells to flash on and off in synchrony) and most recently pe- 
riod three and quasiperiod three behaviours [2] . They have also been applied to 
identifying the neighbourhood and transition rules that will recreate spatiotem- 
poral patterns generated by CA [3] . The closest in nature to our work is that of 
Richards et al. [4] who also look at extracting cellular automata rules directly 
from experimental data. In their case the data relate to the dendritic solidifica- 
tion of NH^Br. The key theme here is to expand this concept to more complex 
multivariate scenarios. 

Section 2 introduces spatiotemporal data mining through a formal treatment 
of the location prediction problem. In section 3 we motivate and present our 
evolutionary CA model. Section 4 details the experimental setup and contains 
preliminary results and analysis. Finally, section 5 concludes and discusses future 
work. 



2 Spatiotemporal Data Mining 

Spatiotemporal data mining is a relatively recent expansion of data mining tech- 
niques to take into consideration the dynamics of spatially extended systems for 
which large amounts of data exist. Given that all real world spatial data exists 
in some temporal context, and knowledge of this context is often essential in in- 
terpreting it, spatial data mining is inherently STDM to some degree. Although 
time series and spatial data mining have existed as research fields for a num- 
ber of years independently, they have been growing towards a synthesis that is 
inherently multi-disciplinary. 



2.1 The Location Prediction Problem 

This multi-disciplinary nature is borne out by the fact that a well known and 
thoroughly researched spatial data mining application is that of predicting nest 
site locations in an ecological data set. A particular data set was first considered 
by Ozesmi [5] in an ecological modelling context, and then taken up as a spatial 
data mining problem by Shekhar et. al. [6]. 

Following [6], the problem is defined as follows: Given 

— A spatial framework S of sites {si, ..., s„} for an underlying geographic space 

— A collection X of explanatory functions fx k '■ S —>■ TR k ,k = 1 IR fc is 

the range of possible values for the explanatory functions 

— A dependent class variable fc ■ S — > C = {ci, ..., Cm}, c m £ {0, 1} 

Find: Classification Model fc ■ 1R 1 X, ..., xIR fe — > C 

In [6], a number of approaches are compared both theoretically and exper- 
imentally. In order to gain insight regarding spatially dependent multivariate 
processes, and to motivate the use of an alternative technique, these approaches 
are also briefly reviewed here in order of increasing sophistication. 
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Linear Regression. The classical linear regression equation does not attempt 
to model spatial dependence. It is defined as 

y = X(3+e (1) 

where (3 = (/3q, (3 m ) T , y is an ?z-vector of observations and X is an n x m 

matrix of explanatory data. This is then transformed from a real- valued variable 
to binary via the logistic function, Pr(ci\y) = e v /(l + e v ). When the samples are 
spatially related, the residual errors reveal a systematic variation over space. 



Spatial Autoregressive Models. In order to model spatial dependence, a 
spatial contiguity matrix is used. The essential idea is that spatial locations which 
are neighbours are coupled through an entry in the matrix. The simplest form is a 
binary matrix where entries are either 1 or 0 depending on a Euclidean adjacency 
metric, however ‘neighbours’ can be defined in any fashion and non-zero elements 
are often scaled to sum to unity in each row. Figure 1 shows an example spatial 
framework and figure 2 a corresponding row-normalised contiguity matrix where 
cells that share an edge have double the weighting of cells that are diagonal 
neighbours. The modified regression equation is 

y = pWy + X[3 + e. (2) 

The dependence on neighbouring classes is exerted in a static fashion through 
the contiguity matrix. The fact that it is at heart a logistic regression model 
means there is an assumption that the class conditional distribution belongs to 
the exponential family. 



Markov Random Fields. A more sophisticated approach is to model the 
data as a Markov Random Field (MRF). A MRF-based Bayesian classifier is a 
non-parametric model which, unlike logistic regression, is not bound to make 
assumptions that the class conditional distribution belongs to a particular fam- 
ily. A MRF explicitly models the relative frequencies in the class prior term. 
Conceptually it is therefore more suited to the real-world nonlinearity of this 
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work 



Fig. 2. Row-normalised contiguity 
matrix for the spatial framework 
shown in figure 1 
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problem and accordingly has the best experimental performance in the study in 

[ 6 ], 

MRFs consist of a set of random variables with an interdependency rela- 
tionship represented by an undirected graph which is analogous to the spatial 
contiguity matrix. The Markov property contends that a variable depends only 
on its neighbours and is independent of all other variables. For this problem, 
that means random variable U is independent of lj if W(si, Sj) = 0, where W is 
the neighbourhood relationship contiguity matrix. 

Using Bayes rule it is possible to predict /, from feature value vector X and 
neighbourhood class label vector Lj as follows: 



Pr(k\X,Li) 



Pr{X\li,Li)Pr{U\Li) 
Pr{X\Li ) 



(3) 



Where Lj is the label vector for the neighbourhood of i. 



2.2 The Curse of Dimensionality 

Despite its theoretical ability to encode complex probabilistic relationships in- 
volving multiple variables without reliance on any assumptions about the dis- 
tributions of class variables, the actual solution procedures for MRFs require 
large amounts of training data and/or a number of limiting assumptions on 
Pr(X\li, Li). This is because the graph of a Markov field must connect all pairs 
of variables that are conditionally dependent even for a single choice of values 
of the other variables [7]. In other words, it is hard to encode interactions that 
occur only in a certain context and are absent in all others. An example would 
be that the probability of a nest site occurring in a location with a certain com- 
bination of environment variables is very different when there happens to be 
three or more birds within a certain distance of that location. In many situa- 
tions these assumptions may be acceptable, however if there is a high degree of 
attribute interaction this is not necessarily the case. Attribute interaction is a 
crucial theme in data mining [8] and is common in many complex real-world sys- 
tems. The ‘curse of dimensionality’ is the situation where there is a high degree 
of attribute interaction over a sparse amount of data. 



3 Evolutionary CA for Data Mining 

The importance of approaching analysis of complicated systems with the goal 
of understanding rather than prediction is becoming more widely acknowledged. 
This places the emphasis on dynamic process rather static pattern, as noted by 
[9]. A model that has such a characteristic is Cellular Automata. CA are well 
suited to a high degree of attribute interaction and sparse data because they are 
not a statistical model of the system which requires a large amount of data to 
give ‘support’ to the probability distributions. Instead they are a model that can 
map every state of the environment to a state of the dependent variable. The 
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penalty for such flexibility however is that it is required to search a potentially 
massive parameter space for the correct rules. 

Evolutionary methods are highly suited to such a problem and have been 
used to evolve CA transition rules for a number of computational tasks. The key 
theme of evolutionary approaches is that the behaviour of the system is defined 
implicitly by a fitness function measuring the difference between the candidate 
solution and the desired solution. No explicit knowledge is needed to show the 
system how to perform the task. 

The line between a forward CA approach - conscious programming of CA 
rules with a resultant behaviour in mind - and a backward or inverse approach 
where no a priori knowledge is used and the desired behaviour is the only guide, 
is inherently indistinct. The nature of the relationship mirrors closely the mining- 
modelling one mentioned previously. In the CA literature this type of problem 
seems to have been predominantly approached in a forward sense. Examples of 
this abound especially in urban modelling [10,9] and, perhaps closer in form to 
our problem, in vegetation dynamics [11]. 

The problem is approached here in an inverse sense because from a data 
mining perspective this means there is no explicit requirement for prior domain 
knowledge. It also ensures the possibility to discover rules that are counter- 
intuitive and/or too complex to program consciously. Thus it is both more flex- 
ible and more general. In the evolution of a complex, higlr-dimensional system, 
it is often the case that there are only a relatively small number of degrees of 
freedom which contribute (above a certain threshold, which is indistinguishable 
from noise) to the dynamics, such that the attractor of such a system can be 
reconstructed in a greatly reduced state space. We see these two characteristics 
as analogous which motivates us to look for a parsimonious rule-set that will 
represent the important factors in nesting behaviour. Having said that, the im- 
mense size of state space is still a huge obstacle, and the method proposed here is 
intended only as a complementary approach: in combination with domain knowl- 
edge and other data mining techniques it offers the chance to discover nonlinear 
spatial and spatiotemporal relationships, even in the face of sparse data. 

3.1 Cellular Automata Model Definition 

The model we propose is a synchronous two-dimensional Cellular Automata 
where data belonging to each grid-cell is the basis for the state of a cell in 
the cellular lattice. More precisely, each cell’s state can be split into two parts. 
The first is the three independent variables: distance to open water, vegetation 
durability, and water depth; these remain constant for the duration of each 
simulation run. The second state denotes the presence or absence of a nest site - 
this state will be more typical of a CA in that it will be recalculated at each time 
step according to the current rules. The first state can be collectively thought of 
as the landscape in which the simulations take place. The second can be thought 
of as the progressive spatial distribution of the birds as they are drawn towards 
the attractor which characterises their nest site behaviour. 

Our CA can initially be defined as: CA = (17, Q , A, /, /) 
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Where: 

— Q = {(i,j)|l < i < L x , 1 < j < L y } is the L x x L y lattice of cell sites 

— Q is a finite set of cell state values 

— N is a neighbourhood template 

— I : f2 — > Q is the initialisation function 

— f:Q x Q\ N \ — > Q is the transition function 

Incorporating the two-part cell state values we have Q = {Qi,Q n }, where: 
Qi = {DOW, V, WD }, Q n = {0, 1} (4) 

Where Qi is the state of the landscape consisting of DOW - distance to open 
water, V - vegetation durability, and WD - water depth. Q n is the presence or 
absence of a nest site. 

The transition function becomes: 

f ■ Ql X Qn X Q\ NA X Qjf" 1 ->• Qn (5) 

Where N n is the neighbourhood for nest sites and JVj is the landscape neigh- 
bourhood. We may additionally wish to have different sized neighbourhood for 
each individual landscape variable depending on domain knowledge that certain 
variables have a larger influence. 



3.2 Genetic Algorithm Model Definition 



Similar to a number of evolutionary CA papers we use a standard CA-GA model 
of GA = {P, T, p, G, E}, where P is population size of candidate rules, T is the 
number of CA time steps, p is the mutation rate, G is the number of generations 
and E is the number of elite rules. However, unlike the artificial computational 
tasks tackled by other evolutionary CA methods which aim to converge to a 
particular target configuration, we are looking to learn a complex probability 
distribution that represents the nest location ‘behaviour’ of the wading birds. In 
order to realistically rate the ability of a CA rule to do this, it is necessary to 
run the GA on a number of training configurations (of target nest site locations) 
which follow some predetermined probability distribution. Also, the number of 
CA time steps before the fitness is measure is nonstandard, and is discussed in 
the next section. 

The fitness function of a rule is perhaps the most crucial ingredient in a suc- 
cessful GA. Our fitness function takes into account spatial accuracy so that con- 
figurations which are spatially near the target configuration are still rewarded. 
This may be crucial given the many potential sources of noise and uncertainty 
in spatiotemporal data sets, and also highly appropriate for discovering qualita- 
tive relationships. A modified version of the Spatial Accuracy Measure (SAM) 
outlined in [6] is used. 



SAM = TPR — FPR = 



AnMPn 



AnnMP 

( 6 ) 



AnM Pn + AnM Pnn AnnM Pn + AnnM Pnn 
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Fig. 3. Distance to Open Water, DOW 



Fig. 4. Water Depth, WD 



where TPR is the true positive rate, FPR is the false positive rate, An[i] = 
fc[si\ and Pn[i] = fc[si] are boolean vectors representing actual nest loca- 
tions and predicted nest site locations respectively, and Ann[i\ = 1 — An[i] and 
Pnn[i] = 1 — Pn[i] are their inverses. The spatial weighting is applied through 
M = W + 1, the matrix addition of contiguity matrix W and identity matrix /. 

4 Experimental Setup, Preliminary Results, and Analysis 

In [12], high fidelity colour images of the three spatial data sets from the study 
in question are provided, along with a colour-intensity scale. Due to a num- 
ber of factors, including: a desire to operate on the same data set for future 
comparisons, the qualitative nature of our approach at this initial stage, and an 
unavailability of other suitable data, we have used these images (rather than raw 
data) for our experiments. The relative values of the variables were preserved 
by creating a colour map based on the provided intensity levels, and the spatial 
integrity of the data was preserved by reducing the number of cells (pixels) in 
the image down to the corresponding number of data grid points in the original 
study. 

In the Spatiotemporal Data Mining section it was shown that both regression 
and MRF models may be forced to make limiting assumptions, especially when 
there is a high degree of attribute interaction. Our first goal was to show that 
an evolutionary CA approach is highly suited to problems that are nonlinear in 
this way. 

In order to investigate this in a experimental fashion, synthetic nest site 
locations were generated using a nonlinear generalisations of equation (2): 

Usyni — (I — pW)~ x x (/3 x cos(X) + c x random(e)) (7) 

Usyn 2 — (I ~ pW)- 1 x (/3 x cos(X) + W x cos(X) x 7 + c x random(e)) (8) 

where W is an equally weighted, row-normalised 7x7 neighbourhood, X is 
the DOW explanatory variable, p is an arbitrary spatial weighting of 0.6, c is a 
relatively small noise weighting of 0.5, e = IV (0, 1), and /3, the weighting on local 
explanatory data, has been set arbitrarily at 0.3. In equation (9), 7 represents 
the dependence of the dependent variable on neighbouring explanatory data; 
this has been set high (relative to 0) at 0.25 to generate a significantly more 
nonlinear problem. 
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Time Step 

Fig. 5. Gradual fitness increase over CA time steps 



Rule Types and Transients. The CA model defined above is very general 
and is likely in most situations to have a prohibitively large state space. In order 
to reduce this state space a number of different simplifications have been used. 
Firstly the the explanatory data has been quantised so that Qi = {0, 1, 2, ..., 16}. 
The level of quantisation should be partly determined by the specific domain. 
More importantly, a number of rule types have been considered. Initially it was 
thought that an outer-totalistic rule would be sufficient to capture the extent of 
the nest-site location interactions, however in looking for a rule that produced an 
increase in fitness over a number of CA time steps (ie one that reached a higher 
‘equilibrium’ fitness) it was discovered that symmetric rules tend to oscillate 
between fitness levels in a more extreme fashion. More complex rule types, while 
when averaged over a large number of specific rules may not have performed any 
better (plenty decreased in fitness over the CA time steps), seemed to produce 
a larger number of rules that purely increased in fitness. The mechanism behind 
this phenomena is more complex rules’ greater propensity for information trans- 
fer across the spatial lattice. In order to retain this property, while increasing 
the state space as little as possible, reflection symmetric (rather than rotation 
symmetric or completely symmetric) rules were preferred. Figure 5 shows the 
increase in fitness over a number of time steps for a von Neumann reflection 
symmetric rule on a single arbitrary target nest location data set. An empirical 
study of the transient behaviour of a large number of CA rules led to the re- 
quirement that at least Te CA time steps elapse before the spatial accuracy is 
measured, at which point it is averaged over at least the next Tav time steps, 
and T = {Te,Tav}. 



Nonlinear Generalisation Capabilities. Thus far a reduced complexity ver- 
sion of the model outlined above has been implemented where the landscape 
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Generation 



Fig. 6. Fitness of GA for two scenarios: one nonlinear in the local explanatory data 
only, the other additionally nonlinear in neighbouring explanatory data 



consists only of the distance to open water variable. Although we have not yet 
performed enough experiments for statistical significance, so far results are en- 
couraging. Figure 6 is a typical graph of fitness vs. generation where for both 
experiments: GA = {P,Te,Tav, p, G, E} = {100,30,5,0.01,100,20}. Each can- 
didate CA rule is matched against 10 synthetically generated target nest site 
locations according to equation (8) for the first experiment and (9) for the sec- 
ond. 

Both populations are performing at similar reasonable fitness levels, espe- 
cially given the difficulty of the task. Secondly and more interestingly, in the 
population with the more nonlinear probability distribution to learn, the fit- 
ness increases take longer to come, but are larger when they do. Intuitively this 
matches our perceptions about the nature of the tasks and the GA’s ability to 
learn nonlinear relationships for a CA model. 

5 Conclusion and Future Work 

We have used an analysis of statistical spatial models to give insight into the 
nature of attribute interaction and as the basis for experimental metrics. We 
have explored the potential for an evolutionary CA approach to STDM and 
presented some encouraging, though preliminary, experimental results. 

Of the two goals set out at the beginning, more emphasis has been placed 
on the suitability of CA to nonlinear problems. Hopefully this has provided 
an experimentally justified foundation for the application of CA to a broader 
range of problems. The suitability of this technique for data mining specifically 
has been given less attention due to space constraints and future work will be 
needed to develop the interpretation of CA rules for data mining purposes in 
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greater depth. Also, the issue of sensitivity to noise needs to be investigated, 
possibly through the application of a probabilistic CA model. 
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Abstract. Large areas of savannas are found in Africa in climatic zones 
favourable to humid tropical forests: they are relicts of past dry periods 
and forest domains are naturally expanding. Men have influenced the 
transgression, especially by starting savanna fires. FORSAT is a stochas- 
tic cellular-automaton model dedicated to the forest-savanna mosaic on a 
landscape scale, taking into account savanna fires, vegetation succession 
cycle (vegetation types are discriminated by their ecological function: 
herbs, pioneer seedlings, pioneer adults, forest) and pionneer seed dis- 
persal and recruitment. The model is validated by comparison between 
its emergent behavior and biogeographical field studies. Two parame- 
ters (an environmental factor and a man-related fire frequency) control 
an van-der-Waals-like phase transition between forest and savanna. This 
brings to three types of evolution: progression or regression of forest edge, 
formation and coalescence of clumps in savanna and global afforestation 
of savanna. These results explain the mainlines of ecosystem distribution 
in tropical Africa and the gap between the transgression rates inferred 
from paleostudies and currently measured. 



1 Introduction 

We know now that current climatic conditions make it possible for the tropical 
forest to expand over the periforest humid savannas, especially in Africa [1, 
2,3,4]. In most regions, this trend is balanced by savanna fires, that are the 
most significant man-induced disturbance (although some are natural) [5]. The 
balance between these two processes, one favorable - the forest succession - and 
the other unfavorable - the fires - results in a pattern of evolution in a particular 
zone. 

The evolution displays some similarities throughout the world. First, savanna 
fires are common in the whole tropics [5] . Secondly, the forest succession happens 
as follow [6,7]. Light-demanding forest pioneer species disperse seeds and settle 
in savanna (at the forest edge or further). Provided they do not burn, they 
grow, shadow the soil and attract forest seed-dispersers. Other forest trees settle 
beyond and so the edge progresses or a grove appears. 

However, if the processes that come into play are identical, there are regional 
particularities in their parameters and details. The pioneer species that first set- 
tle in open-land vary and each of them has its own characteristics (ability for 
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their seeds to travel far, ability to resist fire...). Then, in zones of homogeneous 
vegetation, there can be great changes in how often savannas experience fires (sa- 
vannas near villages usually burn every year, the other more or less often). This 
results in differences especially in the rate of edge progression, in the number and 
shape of clusters of forest trees in savanna. The few existing studies (e.g. [2,4]) 
fail to reconstruct the relation between the ecological and ethnological character- 
istics and the observed emergent behavior. That can be done with a mechanistic 
model, incorporating the key processes and taking the differences into account. 
It allows to check the effects of the virtual tuning of the parameters, their effects 
on the emergent behavior and then check using field studies. Moreover, such a 
model allows to generalize the knowledge acquired in some spots to the whole 
ecologically homogeneous region, where differences can appear, for example, in 
the man-induced disturbance. 

2 The FORSAT Model 

2.1 Modelling the Key Processes 

The FORSAT model [8] is designed in the cellular automata framework: the 
space unit is a 5-meter-sided square cell which can take four states of vege- 
tation : ‘ Herbs ' , ‘ Young Pioneers', 1 Adult Pioneers' and l Forest'. These states 
correspond to vegetation stages different in their constitution and their fire sen- 
sitivity. ‘ Herbs’ represents grasses and herbs (either savanna grasslands with 
a sparse shrubby vegetation or low understory vegetation in forest clearings). 

‘ Young pioneers ’ represents seedlings or bushes of forest woody pioneer species: 
it is the first stage in the colonization by the forest. ‘ Adult pioneers’ is dominated 
by seed producing pioneer trees with crowns above understory species. ‘Forest’ 
corresponds to patches of dense forest where the herbaceous cover is low and 
the structure, if not the species cohort, has reached an equilibrium. This choice 
brings compatibility with remote sensing data: the spatial scale is of the order of 
the resolution of satellite data and NDVI data can help discriminate between the 
different states of vegetation. What we here call stages of vegetation are similar 
to Plant Functional Types (PFTs), in the sense that trees are defined by their 
roles, which are played by different species in different parts of the tropics. 

A simulated year consists in the succession of two processes, modelled 
stochastically (figure 1): the evolution following the succession cycle with yearly 
probabilities to for each cell to switch to the next state and the fire propaga- 
tion in savanna. The choice for stochastic modelling is motivated by the need 
of restraining the numbers of parameters involved in processes poorly known in 
detail. 

During the succession processes, each cell is assigned a probability to turn into 
the next state in the succession cycle. These probabilities are related to biological 
processes and properties: rates of seed dispersal and sprouting, growth, mortal- 
ity... For the transitions from ‘ Young Pioneers' to ‘ Adult Pioneers' (growth) and 
from any stage to ‘ Herbs' (death), the probabilities of transition are uniformly 
distributed. The transitions from ‘ Herbs' to ‘ Young Pioneers' (recruitment) and 
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Fig. 1 . Synopsis of the processes modelled by FORSAT 



from ‘ Adult Pioneers' to 1 Forest’ (maturation) involve a spatially structured sub- 
process: seed dispersal. Each seed producer - represented by 1 Adult Pioneers' and 
1 Forest’ cells for the recruitment transition and by 1 Forest’ cells for the matura- 
tion transition - confers to other cells a probability of transition depending on 
their relative distances. The function relating the probability of successful seed 
dispersal events at the distance from the parent tree is termed seed shadow. 
Short-distance seed shadow is considered explicitly, whereas far seed dispersal is 
considered to confer an additional background uniform probability. 

Savanna fires are modelled by an iterative bond-percolation-type algorithm, 
similar to that used in [9]. Each cell in the direct neighborhood of a burning 
one is conferred a probability to be ignited, that depends on wind and on its 
vegetation stage (the four vegetation stages have decreasing susceptibility to 
burn) . 

Two parameters influence these processes: an environmental factor (modify- 
ing the transition probabilities and representing how environmental conditions 
as climate or soil facilitate the succession cycle) and the frequency of lightning 
of fires in savannas. 

2.2 Model Analysis 

The first step of the model analysis consisted in choosing realistic parameters 
(although not explicitly based on field studies) to check whether the processes 
chosen are able to reproduce qualitatively the real patterns of transgression. The 
initial state is a 180 x 200 cell landscape of forest savanna interface displaying 
two savanna gulfs. 

Then, we studied the quantitative effects of both parameters. For different fire 
frequencies, we investigated the equilibrium for varying environmental factors 
with two kinds of initial conditions: savanna (only ‘Herbs’) and forest (only 
‘ Adult Pioneers’). The other parameters 1 have been taken as simple as possible 
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Table 1. Summary of the parameters used for the different simulations. (1): parameters 
used for exploring the qualitative emergent behavior. (2): parameters used for the 
forest-savanna phase transition study. 



State 


Probability of transition 
to the next state 


Burning 

probability (1,2) 




(1) Global: 0.02 

Local: Gaussian probability 




‘Herbs' 


p = 0.1 • exp(— 0.1a: 2 ) 

up to 10 cells near seed cells 

(2) Global: 0.05 

Local: p = 0.1 next ‘ Adult Pioneers' 


1 


‘ Young Pioneers' 


Global: 0.2 


0.5 




(1) Local: 0.2 




‘ Adult Pioneers' 


next forest cells 
(2)- 


0.2 


‘ Forest ' 




0.005 


Other parameters: 


(1,2) Death probability: 0.005 

(1) Fire frequency: 0.5 

(1) Environmental factor: 1.0 





to investigate the phenomenological effects (pioneer and forest seed shadows 
restricted to the nearest neighbors) .In both case, the map contains 256 x 256 
cells, of which the 200 x 200 cell central region is considered to avoid edge effects. 

3 Emergent Behavior 

3.1 Phenomenological Behavior 

FOR.SAT proved able to reproduce the qualitative patterns of forest transgres- 
sion observed in field studies (figure 2): (i) the linear progression of the forest 
edge, (ii) the filling of savanna gulfs - that is a faster edge progression in the 
curved edges (iii) the creation of clusters of forest in savanna, (iv) their co- 
alescence or incorporation in the forest domain starting by a vegetation bridge 
between the two entities. 

Two aspects, more or less independent, need to be considered: the evolution 
of the forest edge and the evolution of clusters of forest trees in savanna. 

3.2 Savanna Phase or Forest Phase 

With the simplified model, two kinds of equilibrium can be reached: a forest, 
where the region is dominated by 1 Adult Pioneers' cells or a savanna, dominated 
by 1 Herbs' and 1 Young Pioneers' cells. In reference to the statistical physics 
theory, these two equilibria can be considered as two phases of the dynamical 
system. A given state can be characterized by the proportion of space occupied 
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Fig. 2. Phenomenological emergent behaviour: progression of forest edge, filling of 
savanna gulfs, nucleation, growth and incorporation of clusters. Zoom: bridge of vege- 
tation between the forest and the cluster, (maps: 180 x 200 cells) 




Are frequency fire frequency 



Fig. 3. Left: critical line in the parameters space separating the regions for which the 
equilibrium is forest or savanna. Right: phase transition diagram. For low fire frequen- 
cies, the transition is continuous: all tree covers can be reached. For more frequent 
fire frequencies, the transition is discontinuous: only very low (for low envorinmental 
factors) or very high tree covers (for high environmental factors) can be rached. 



by the largest cluster of ‘ Adult Pioneers’ cells: this order parameter (analogous 
to that for percolation systems) is near zero for the savanna phase (a contin- 
uous herbaceous layer with scattered trees) and positive for the forest phase 
(continuous tree cover possibly interspread with herbs). 

The two parameters control a van-der-Waals-like phase transition (figure 3) . 
In the parameter space, the critical line is increasing: the greater the environe- 
mental factor is, the greater the fire frequency must be to maintain a savanna. 
Moreover, the transition is continuous for low environmental factors/fire fre- 
quency but discontinuous for higher values. This discontinuity is characterized 
by an hysteresis: there is a range of parameters for which both savanna and 
forest can be stable. 





